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Preface 


This book grew out of the lecture notes we developed for the Oberwolfach Seminar on 
Metric Algebraic Geometry. An Oberwolfach Seminar is essentially a summer school for 
PhD students and postdocs. Ours was held in the week of May 29 to June 2, 2023. Each 
of the three lecturers presented five of the 15 chapters. The lectures were supplemented by 
intense working sessions and inspiring evening discussions. 

While developing our material for the school, we were motivated by the following 
thought. In the early 19th century, there was no difference between algebraic geometry and 
differential geometry. The two were part of the same subject. Geometers studied natural 
properties of curves and surfaces in 3-space, such as curvature, singularities, and defining 
equations. In the 20th century, the threads diverged. The standard curriculum now offers 
algebraic geometry and differential geometry in rather disconnected courses. 

In the present days, geometry plays an important role for data science, and this requires 
us to rethink the schism bemoaned above. Many applied problems center around metric 
questions, such as optimization with respect to distances. These require tools from different 
areas in geometry, algebra and analysis. 

To offer a path towards integration, we propose a circle of ideas which we call metric 
algebraic geometry. This term is a neologism which joins the names metric geometry and 
algebraic geometry. It first appeared in the title of Madeleine Weinstein’s PhD dissertation 
(UC Berkeley 2021). Building on classical foundations, the field embarks towards a new 
paradigm that combines concepts from algebraic geometry and differential geometry, with 
the goal of developing practical tools for the 21st century. 

Many problems in the sciences lead to polynomial equations over the real numbers. The 
solution sets are real algebraic varieties. Understanding distances, volumes and angles — in 
short, understanding metric properties — of those varieties is important for modeling and 
analyzing data. Other metric problems in optimization and statistics involve, for instance, 
minimizing the Euclidean distance from a variety to a given data point. Furthermore, in 
topological data analysis, computing the homology of a submanifold depends on curvature 
and on bottlenecks. Related metric questions arise in machine learning, in the geometry of 
computer vision, in learning varieties from data, and in the study of Voronoi cells. 

This book addresses a wide audience of researchers and students, who will find it useful 
for seminars or self-study. It can serve as the text for a one-semester course at the graduate 


Vii 


viii Preface 


level. The key prerequisite is a solid foundation in undergraduate mathematics, especially in 
algebra and geometry. Course work in statistics, computer science, and numerical analysis, 
as well as experience with mathematical software, are helpful as well. 

We hope that you enjoy this book. It will invite you to develop your own perspective, 
and to share it. 


Osnabrück Paul Breiding 
Stockholm Kathlén Kohn 
Leipzig Bernd Sturmfels 


November 2023 
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Chapter 1 D 
Historical Snapshot Check tor 


| updates 


Throughout this book, we will encounter the interplay of metric concepts with algebraic 
objects. In classical texts such as Salmon’s book [156], metric properties of algebraic 
varieties were essential. This includes the curvature of algebraic curves and computing 
their arc lengths and circumscribed areas using integral calculus. Conversely, many curves 
of interest were defined in terms of distances or angular conditions. This chapter provides 
an introduction to this classical approach to algebraic curves. 

The chapters that follow develop a modern view on metric algebraic geometry. This 
involves algebraic structures in distance minimization (in Chapter 2), optimal transport (in 
Chapter 5), machine learning (in Chapter 10), and computer vision (in Chapter 13). We 
will often use the word model when talking about an algebraic variety because in these 
settings algebraic varieties serve as abstract mathematical models for data. In particular, we 
pose the variety hypothesis: information in data can be described by polynomial equations. 

Our starting point is typically a real algebraic variety X in real affine space R”. To 
utilize methods from algebraic geometry, we pass to the Zariski closure of X in complex 
affine space C”, or in complex projective space P”. In most cases, to avoid heavy notation, 
we use the same symbol for both X and its (projective or affine) complexification. The 
metric in metric algebraic geometry enters in the form of a notion of distance. The default 
is the distance induced by the standard Euclidean structure on R”. Over the real numbers, 
we use the Euclidean inner product. For p, q € R”, this inner product is defined as 


(p,q) = p'a = )) piai 
i=] 


This induces the Euclidean norm, which is defined by ||p|| := -/(p, p). The Euclidean 
distance function d(p, q) := ||p—q|| turns R” into a metric space. The usual metric in the 
complex space C” is given by the Hermitian inner product (p, q)c = p*q. 

However, complex conjugation is not an algebraic operation, and we avoid using it. 
Instead, throughout this book we consider the algebraic extension of the Euclidean inner 
product and write (p,q) = p'q also when p,q € C” are complex vectors. This yields a 
non-degenerate bilinear form on C”, which we use to identify C” with its dual space. In 
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this manner, we can consider an algebraic variety and its dual within the same ambient 
space. We emphasize that, over the complex numbers, the bilinear form (-, -) is not positive 
definite, and hence does not induce a metric. For instance, we have ((1,i),(1,7)) = 0. 
Nevertheless, using the algebraic extension of the Euclidean inner product, we will be able 
to study metric problems in R” using methods from complex algebraic geometry. In some 
chapters, we will use other metrics, but this will be spelled out explicitly. For instance, 
in Chapter 5 we care about distance functions induced by norms whose unit balls are 
polytopes. However, the standard choice of metric is the Euclidean metric above. 

With these 20th-century basics out of the way, we shall now step into the 19th century. 


1.1 Polars 


Our first historical snapshot concerns polars of algebraic curves in the plane. This will lead 
us to metric definitions of special families of curves. 

For a fixed line L in the real plane R? with a distinguished point o € L, we can choose 
a positive and a negative direction and define a signed Euclidean distance on L. More 
concretely, we fix a unit vector v through o, such that v spans L. We define 


op := AER, wherep = o+dAveL. 


In particular, this definition depends on the line L, the reference point o, and the chosen 
direction given by the vector v. The unsigned distance is the Euclidean distance 


lop| = llo- pll. 


According to Salmon [156, §56], the following theorem was first proven by Roger Cotes 
(1682-1716) in his Harmonia Mensurarum. It is valid both over R and over C. 


Theorem 1.1 (Cotes [51]) Fix a point o in the plane and an algebraic curve C of degree d. 
Consider any line L through the point 0 that intersects the curve C in d distinct points 


ri, ..., rq. We denote by pz the point on L whose signed distance to 0 satisfies 
d 1 1 1 
= + +---+—. ad à 1) 
opL or) or2 ora 


Then, the following subset of the plane is a straight line: 


K := U { pz | pz satisfies Equation (1.1) }. 


L is a line through o 


Salmon called K the polar line of the curve C and the point o. Note that our definition 
of the polar line made use of the Euclidean metric. 


Example 1.2 (d = 2) Consider a conic C and a point o outside of C. Then the polar line is 
spanned by the two points on C whose tangent line contains 0; see Figure 1.1. © 


1.1 Polars 3 


Fig. 1.1: A conic and its polar line (blue) with respect to the point o. 


The relation (1.1) means that the signed distance from o to pz is the harmonic mean of 
the signed distances from o to the intersection points r;. We rewrite this as 


d 
>| : - ae} = 0. (1.2) 
= \ opr Or; 


Suppose now that the point o lies on the line at infinity. Then the point pz becomes the 
average of the points r1, r2, ...,ra. This is proved in the following corollary. 


Corollary 1.3 Let C be a plane curve of degree d and L a pencil of parallel lines. Let pr 
be the average of all (complex) intersection points of the curve C with some line L in £: 


d 
1 
PL := ada where LAC = {r1,%,.-..¥a}- (1.3) 


Then the set of all points pL, as L ranges over L, forms a straight line in the plane. 


Proof We start by fixing a line L € L. We bring the two fractions in Equation (1.2) to 
a common denominator, we use that or; — Op, = Or; + PLO = PLY;, and we multiply the 
result by opz. In this way, we see that (1.2) is equivalent to 


qd —_ 
Pz 
Lo = = (1.4) 


In the latter calculation, we kept the same choice of direction on the line L, but we changed 
the reference point from o to pz. To investigate what happens in the limit when o goes to 
infinity, we fix a point 09 on the line L such that A; := Oor; > 0. Let v be the vector defining 
the line L and its direction. We can express the points on L using t € Ras 


O; := 00 — tV. 
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For every fixed tf > 0, we have 0;r; = t + A; and we write pz, for the point pz that 
satisfies (1.4) for o;. Multiplying (1.4) with t, we obtain 


a t 
. r; = 0. 
> t+Ài PL,t i 


i=l 


In the limit t — oo, the point pz; converges to a point pz. We conclude that sha PLY; = 0. 
In words, the point pz is the average of the points r1, r2,...,ra. For o at infinity, (1.1) 
describes the average point pz in (1.3), and Theorem 1.1 specializes to Corollary 1.3. o 


Salmon attributes Corollary 1.3 to Newton, who called the resulting straight line the 
diameter of the curve C corresponding to the parallel-lines pencil £ [156, §51]. We point 
out that Cotes and Newton knew each other. In fact, Cotes edited the second edition of 
Newton’s Principia before its publication. 

Corollary 1.3 is important in contemporary applied mathematics. It has been extended 
to higher-dimensional varieties and is known as the trace test in numerical algebraic 
geometry [161, Chapter 15.5]. 


Remark 1.4 Here is another delightful theorem on distances of a point to intersection points 
between a curve and lines. It was first given by Newton in his Enumeratio Linearum Tertii 
Ordinis: For a plane curve C of degree d, a point o in the plane, and two distinct lines 
passing through 0, consider the ratio 


or, : Or? -:: Org 
OS] -0S2 +++ OSq 


(1.5) 


where r1, r2,..., ra ands, S2, ...,Sq are the intersection points of the two lines with C. The 
ratio (1.5) is invariant under translating the point o and the two intersecting lines [156, $46]. 


In [156, §57], Salmon extends the construction of polar lines to polar curves of higher 
order. Using notation as in Theorem 1.1, he shows that the locus of points pz satisfying 


D l rll r) 
al OF} Op” OF; 


instead of (1.1) is a conic, called the polar conic of the curve C and the point o. If 0 is at 
infinity, the polar conic is also referred to as the diametral conic. More generally, the locus 
of points pz satisfying 


eaea 
néi OPL OF; opr r; OPL Oi 


is the polar curve of order k associated with the curve C and the point o [156, §58]. If o 
is at infinity, that polar curve is called the curvilinear diameter of order k. In conclusion, 
the 17th-century definition of polar curves has a metric origin. Measuring distances plays 
a crucial role in what we have seen so far. 
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In the 20th-century algebraic geometry literature, on which our Chapter 4 on polar 
varieties will rest, the definition of polar curves makes no reference to any metric. Namely, 
polar curves are characterized without involving distances, as follows. 

The defining equation of the curve C in homogeneous coordinates equals f(x, y, z) = 0, 
where f has degree d := deg C. For o = (a: b : c), we define the differential operator 


ô O ô 
^o := a—+b—+c—. 1. 
az + ay tez (1.6) 


Then Aa f is a homogeneous polynomial of degree k in x,y,z. This is the defining 
equation of the polar curve to C of order k. This was proved by Salmon in [156, $63]. 

By exploiting symmetry in Taylor series, Salmon also shows the following beautiful 
duality between polar curves that “may be written at pleasure” [156, $63]. Denoting by p 
the point with homogeneous coordinates p = (x : y : z), Salmon’s result states: 


: 1 
AS Tn) = Ad abe): 


(d—k)! k! 


In particular, the polar curve of order k that is associated with the degree d curve C and 
the point o is the locus of all points p such that the polar curve of order d — k associated 
with C and p passes through o. 


1.2 Foci 


“... we believe that it will be found that every point which has any special relation to any 
curve will be found either to be a singular point of the curve, or a focus of it” 


(George Salmon [156, §125]) 


An ellipse is the set of points in R? whose sum of distances to two fixed points is constant. 
The two points are the foci of the ellipse. In 1832, Pliicker generalized the definition of 
foci to arbitrary plane curves, in the manner described below. Consider a circle in the 
plane. If we embed the affine plane into the projective plane P? by sending (x, y) € C? to 
(x : y : 1), then every circle passes through the two circular points at infinity (1 : i : 0) 
and (1 : —i : 0). In fact, passing through both circular points characterizes circles. 


Lemma 1.5 An irreducible quadratic curve is a circle if and only if it passes through the 
two circular points at infinity. 


Proof Let f(x,y,z) = (x — az)? + (y — bz)? — r?z? be the polynomial defining a circle. 
Then, f(1 : +i: 0) = 1? + (+i)? = 0. This shows one direction. For the other direction, let 
f = a,x? +azxy+a3y?+a4xz+asyz+a6z” be an arbitrary real quadric. If f passes through 
the circular points, then 0 = f(1 : i : 0) = aı+az2i—a3 and0 = f(1 : —i : 0) = aj—azi-a3. 
This implies a, = a3 and az = 0. Hence, f is a circle in the affine plane {z = 1}. o 


The following definition goes back to Plücker [147]. 
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Definition 1.6 Consider a plane curve C. A point f in the plane is a focus of the curve C if 
both lines spanned by the point f and the circular points at infinity are tangent to C. 


Fig. 1.2: The real foci of an ellipse with width 2a and height 2b (a > b) and its axes aligned with the 
x-axis and y-axis of the real affine plane are (+Va? — b?, 0). 


Example 1.7 We determine the foci (in the sense of Pliicker’s Definition 1.6) of an ellipse. 
Since the condition that a line in P? passes through the point (1 : +i : 0) is invariant under 
translations and rotations of the real affine plane {z + 0} c P?, we may translate and rotate 
the ellipse such that its two defining foci (whose sum of distances is constant along the 
ellipse) become (c,0) and (—c, 0) in the real affine plane (with c > 0). Now the ellipse is 
the locus of points (x, y) that satisfy, for some constant a > 0, the equation 


Ja@-o2+y? + Vlx+o)2 +y? = 2a. 


From this, we see that the two points (+a,0) lie on the ellipse. Moreover, the ellipse 
intersects the y-axis at (0, +b) such that b? = a? —c*; see Figure 1.2. Hence, the width and 
height of the ellipse are 2a and 2b, respectively, (with a > b > 0) and its defining equation 
can be written as 


x? y? 


7 + D2 =Í 

If the ellipse is not a circle (i.e., a > b), then it has two tangent lines that pass through 
the circular point at infinity (1 : 7 : 0), namely the tangent lines at p, := (a? : ib? : c) 
and p- := (—a? : —ib? : c). The tangent lines at the complex conjugate points p, and P- 
contain the other circular point (1 : —i : 0). The foci à la Plücker are the four points 
of intersection of the two tangent lines through (1 : i : 0) with the two tangent lines 
through (1 : —i : 0). Since the tangent lines at p+ and p+ are complex conjugates, they 
meet at a real point, namely (+c : 0: 1). This shows that the real foci we used to define 
the ellipse are indeed foci in the sense of Pliicker’s Definition 1.6. The other two foci are 
obtained by intersecting the tangent line at p+ with the tangent line at p+. They are the 
imaginary points (0 : Fic : 1). 
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If the ellipse is a circle (i.e., a = b), then it passes through the two circular points at 
infinity. The tangent lines at those points are the only ones that pass through the circular 
points at infinity. They are given by x + iy = 0 and intersect at the origin (0 : 0: 1). Thus, 
in this case, the four foci coincide, and they all lie at the center of the circle. > 


In general, the number of foci of an algebraic plane curve depends on its class, that is 
the degree of its dual curve. The dual projective plane is the set of lines in the original 
projective plane P*. The dual curve CY of a plane curve C c P? is the Zariski closure in 
the dual projective plane of the set of tangent lines 7,C at regular points x of C. Hence, the 
degree of the dual curve CY (equivalently, the class of C) is the number of tangent lines 
to C that pass through a generic point in the plane P?. Using the usual Euclidean inner 
product (-,-), we can view the dual curve as a curve in P*. More specifically, the dual is 


CY = {y | there is a regular point x € C with (y, v) = 0 for all v € TC} c P. (1.7) 


This means that CY is the Zariski closure of the set of points representing normal lines 
to C. Here, orthogonality is measured by (+, -). If C is defined by the polynomial f and we 
have a point x € C, then the normal line of C at x is spanned by the gradient 


Vf (x) = (Af /dx1(x), Of /Ax2(x))". 


Consequently, 
CY = {Vf (x) | xis a regular point of C}. 


Suppose now that the curve C has class m. Then, through each of the two circular 
points at infinity, there are m tangent lines to the curve C. There are m? intersection points 
of these two sets of m lines. Furthermore, if the curve C is real then the intersection 
points of conjugate pairs of tangent lines are also real. This leads to the following result; 
see [156, §125]. The case m = 2 was featured in Example 1.7. 


Proposition 1.8 A curve C of class m has m? complex foci (counted with multiplicity). 
When the curve C is real, exactly m foci are real. 


In the remainder of this section, we shall examine the foci for additional classes of curves. 
There are several constructions that generalize ellipses in an obvious way. For instance, an 
n-ellipse is the locus of points in a plane whose sum of distances to n fixed points in the 
plane is constant. The class of n-ellipses was studied by Tschirnhaus in 1686 [168] and 
Maxwell in 1846 [129]. As we defined them, n-ellipses are not algebraic, but semialgebraic. 
In fact, they are boundaries of planar spectrahedra [137]. 

In what follows, we focus on an alternative generalization. Instead of considering the 
sum of distances to foci, we consider a weighted sum or product. This leads us to Cartesian 
ovals and Cassini ovals, respectively. The Cartesian oval is named after Descartes who 
first studied them in his 1637 La Géométrie for their application to optics. This curve is the 
locus of points in a plane whose weighted sum of distances to two fixed points is constant, 
i.e., it is the locus of points p that satisfy the metric condition 


lp — fill + sllp — fall = r, 
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Fig. 1.3: Cartesian ovals with defining foci fı = (0, 0), f2 = (1,0) and weight s = 2 are described by the 
equation (—3(x* + y?) + 8x +r? — 4)? — 4r? (x? + y?) = 0. The left picture shows the Cartesian oval for 
r = 3 and the right picture shows it for r = 2. The foci are the red points. In both cases, the focus at (4/3, 0) 


has order three. In the right picture, the focus at (0, 0) has order two. This gives six real foci in total. 


where fı and f, are fixed points in the plane and s, r are fixed constants. The Cartesian oval 
satisfies an equation of degree four. Hence the Zariski closure of the Cartesian oval is a 
quartic curve. The real part of this quartic curve consists of two nested ovals; see Figure 1.3. 
Of the four equations 


lp — fill + slip — fol] = +r, (1.8) 


exactly two have real solutions and those describe the ovals. Salmon shows that a quartic 
curve is a Cartesian oval if and only if it has cusps at the two circular points at infinity; 
see [156, $129]. By Pliicker’s formula [156, $72], it follows that the class of such a quartic 
is six (except in degenerate cases). Salmon determines the six real foci of Cartesian oval 
quartic curves [156, $129] (see also Basset [13, §273]): Three of them form a triple focus 
which is located at the intersection of the cusps’ tangent lines. The remaining three foci lie 
on a straight line. Two of them are the points fı and f that define the curve in (1.8). It was 
already observed by Chasles in [45, Note XXI] that any two of the three single foci can be 
used to define the Cartesian oval by an equation of the form (1.8). If two of the three single 
foci come together, the Cartesian oval degenerates to a Limacon of Pascal; see Figure 1.3. 
Salmon further observes another interesting metric property of foci: whenever a line meets 
a Cartesian oval in four points, the sum of the four distances from any of the three single 
foci is constant [156, §218]. 

A Cassini oval (named after Cassini who studied them in 1693 [40]) is the locus of points 
in a plane whose product of distances to two fixed points fı and fù is a fixed constant r. 
Hence the Cassini oval is an algebraic curve defined by the real quartic polynomial 


lp- fill? = Ip - fal? = 7°. 
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Fig. 1.4: The Cassini ovals in Equation (1.9) with defining foci fı = (—2,0) and fọ) = (2,0). All foci are 
shown in red. The pictures display the cases r = 3 (left), r = 4 (middle) and r = 5 (right). The outer foci 
on the x-axis all have multiplicity 3. The middle focus in the second picture has multiplicity 2. 


The circular points at infinity are double points on each Cassini oval. Since these are 
typically the only singularities, the class of a Cassini oval is eight in general [156, §219]. 

Basset [13, §247] explains that the two pairs of complex conjugate tangent lines at the 
two nodes intersect at f, and f, respectively. Hence, those points are foci in the sense of 
Pliicker’s Definition 1.6. Moreover, Basset shows that each of them is a triple focus (the 
reason being that the nodal tangents are stationary). To describe the remaining two real 
foci, we translate and rotate (as in Example 1.7) such that fı = (c,0) and f2 = (—c, 0). 
Then, the Cassini oval is defined by the quartic equation 


((x-c)? + y*)((x+0)" + y’) =r’. (1.9) 


If r < c?, then the real locus consists of two ovals. Otherwise, the real locus is connected, 
where the degenerate case r = c? is the lemniscate of Bernoulli; see Figure 1.4. In the case 


of two ovals, the remaining two real foci also lie on the x-axis; they are (+4vc4 —r?,0). If 


r > c’, the two foci are (0, +ivr? — c4) and lie on the y-axis. In the degenerate case r = c°, 


they become a double focus at the origin. 

In conclusion, the foci of a plane curve are landmark points of a metric origin. Starting 
from these foci, one obtains interesting classical curves, like Cassini ovals and Cartesian 
ovals. These curves are gems in the repertoire of metric algebraic geometry. 


1.3 Envelopes 


Given a one-dimensional algebraic family of lines in the plane, its envelope is a curve such 
that each of the given lines is tangent to the curve. We can view the family of lines as an 
algebraic curve £ in the dual projective plane. The envelope is the dual curve £Y, which 
now lives in the primal projective plane. This is a first instance of the biduality, which we 
shall see more generally in Equation (2.15) and Theorem 4.11. 


Example 1.9 The diameters of a plane curve C form a one-dimensional family of lines. 
Indeed, there is one such diameter for each point on the line at infinity; cf. Corollary 1.3. 
For a cubic curve C, the envelope of that family is the locus consisting of the centers 
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Fig. 1.5: The cubic curve x? + y? + 5x? — 2xy + x — 1 = 0 (blue) and the envelope 9x y + 15y — 1 = 0 (red) 
of its diameters. The diameter (green) and diametral conic (yellow) are associated with the point (2 : 1 : 0) 
at infinity. Their equations are given in Example 1.9. 


of the diametral conics of C [156, §160]. For instance, for the cubic curve defined by 
x3 + y3+5x? —2xy +x — 1 = 0, the diameter associated with the point (a : b : 0) is the line 


3a7x + 3b’y + 5a? — 2ab = 0. 


The family of all diameters is the curve £ defined by 25x? — 4xy — 30x +9 = 0. Its dual 
curve L“, that is the envelope of the diameters, is 9xy + 15y — 1 = 0. The diametral conic 
associated with (a : b : 0) is given by the equation 


5a-b\ a \2 (5a-b) a? 
3 + +3b( -2) + - —— -— =0. 
“ b ) V= 3b) 7° 3a 3b 
Its center (Se , 35) lies on the envelope. In fact, the tangent line of the envelope at that 
point is the diameter associated with (a : b : 0); see Figure 1.5. > 


Evolutes and caustics are instances of envelopes defined by metric properties. The 
evolute of a plane curve C is the envelope of its normals (i.e., the lines orthogonal to 
its tangent lines). Equivalently, the evolute is the locus of the centers of curvature (see 
Proposition 6.2). The study of evolutes goes back to Apollonius (ca. 200 BC); see [172]. 

A recent study of evolutes can be found in [146]. The degree of the evolute of a 
general smooth curve of degree d is 3d(d — 1) (see Corollary 6.9). The class of this 
evolute equals 4?, by [156, §116]. Moreover, the evolute of the general degree d curve has 
£(3d — 5)(3d* — d — 6) double points, 3d(2d — 3) cusps, and it has no other singularities. 

The evolute of the Cartesian oval in Figure 1.3 is illustrated in Figure 1.6. Its equation is 
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Fig. 1.6: The Cartesian oval from Figure 1.3 (left) together with its evolute. 


102036672x!0y* + 433655856x8 y4 + 833407380 xf y® + 917059401 x78 

+ 558336726x7y!9 + 143065521y!? — 884317824x° y? — 3106029888x/ y* 

— 4898885832x> yf — 4008450240x? y? — 1331276472xy!° + 3251316672x8y? 
+ 9515584512x°y* + 12088352844x*y° + 6432939486x7 y" + 620191890y!° 
— 40310784x° — 6758774784x' y? — 16647933888x° y* — 15962551632x° yÉ 
— 4237194240xy® + 342641664x° + 9145229184x°y? + 18728830368xty4 

+ 11743648812x7y° + 961612425 y® — 1239556608x! — 9234062208x° y? 

— 14497919136x> y* — 4640798304xy° + 2495722752x° + 8064660672x* y" 

+ 8003654064x7y* + 835700656y° — 3071831040x> — 6288399360 x? y? 

— 2974296960xy* + 2390342400x* + 3772699200xy* + 540271200y4 


— 1173312000x? — 1396800000xy~ + 349920000x” + 228000000” 
— 57600000x + 4000000 = 0. 


We will see in Proposition 6.5 that the finite cusps of the evolute correspond to the points 
of critical curvature of the curve C. Salmon computes the length of an arc of the evolute 
as “the difference of the radii of curvature at its extremities” [156, §115]. The converse 
operation to computing the evolute is finding an involute; that is, for a given plane curve C, 
find a curve whose evolute is C. Involutes of an algebraic curve are typically not unique 
and they might not be algebraic. For instance, the involute of a circle is a transcendental 
curve [156, §235]. The nonuniqueness of involutes can be seen from the offset curves in 
Section 7.2. Any two offset curves of a given curve have the same evolute. Finally, we note 
that the foci of a plane curve are also foci of its evolute and its involutes [156, §127]. 

We conclude with caustics of plane curves. These come in two flavors. Let us imagine 
that a fixed point r in the plane emits light. The light rays are reflected at each point of 
a given plane curve C. The caustic by reflection is the envelope of the family of reflected 
rays. Similarly, the caustic by refraction is the envelope of the family of refracted rays. At 
a point p of the given curve C, the refracted ray is defined as follows: If <; is the angle 
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Fig. 1.7: Caustics by reflection (left) and refraction (right) of a circle with light source at infinity. 


that the light ray from the radiant point r to p makes with the normal of C at p and <p is 
the angle between the refracted ray and the same normal, then the ratio $7 is a constant 
independent of p, called refraction constant. Figure 1.7 shows both caustics for a circle 
where the radiant point is at infinity. Those curves can be commonly observed in real life, 


e.g., when the sun shines on a round glass. 


Example 1.10 Caustics by refraction of circles are evolutes of Cartesian ovals. This result 
is due to Quetelet [156, $124]. For instance, the evolute of the Cartesian oval in Figures 1.3 
(left) and 1.6 is the caustic by refraction of the circle with radius 3 which is centered at the 


triple focus (3, 0). Here, the radiant point is fı = (0, 0) and the refraction constant is Z. © 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International 
License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and 
reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the 
source, provide a link to the Creative Commons license and indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s Creative Commons 
license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter’s 
Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the 
permitted use, you will need to obtain permission directly from the copyright holder. 


Chapter 2 A 


Critical Equations | Sites! 
We consider a model X that is given as the zero set in R” of a collection {fi,..., fk} of 
polynomials in n unknowns x1, ...,Xn. Thus, X is a real algebraic variety. In algebraic 


geometry, it is preferable to work with the complex algebraic variety defined by the same 
polynomials. This is the Zariski closure, and we will use the same symbol: X c C”. We 
assume that X is irreducible, that Ix = (fi, ..., fk) is its prime ideal, and that the set of 
nonsingular real points is Zariski dense in X. The k x n Jacobian matrix J = (0 fiı/ðx;) 
has rank at most c at any point x € X, where c = codim(X). The point x is nonsingular on 
X if the rank is exactly c. The variety X is called smooth if all its points are nonsingular. 
Elaborations on these hypotheses are found in many textbooks, including [133, Chapter 2]. 

The following optimization problem arises in many applications, and we shall revisit it 
again and again throughout this book. Given a data point u € R”\X, compute the distance 
to the model X. Thus, we seek a point x* in X that is closest to u. The answer depends on the 
chosen metric. We focus on least-squares problems, where the metric is a Euclidean metric. 
Furthermore, we assume that the optimal point x* is smooth on X. One can compute x* by 
solving the critical equations. In optimization, these are known as first-order conditions or 
KKT equations, and they arise from introducing Lagrange multipliers. We seek to compute 
all complex solutions to the critical equations. The set of these critical points is typically 
finite, and it includes all local maxima, all local minima, and all saddle points. 


2.1 Euclidean Distance Degree 


We begin by discussing the Euclidean distance (ED) problem, which is as follows: 


minimize ||x — ull? = ) (x; - u)? subject to x = (x1,...,Xn) E X. (2.1) 


n 
i=] 


This optimization problem is defined over the real locus of X. We study it by the strategy 
that we announced above: we first derive the critical equations for (2.1), and then study 
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them over the complex variety X. Thus, we interpret the optimization problem (2.1) as a 
problem of solving polynomial equations over C. 

The augmented Jacobian matrix AJ is the (k + 1) X n matrix which is obtained by 
placing the row vector (x; — u1,...,Xn — Un) atop the Jacobian matrix J. We form the 
ideal generated by the (c + 1) x (c + 1) minors of AJ, we add the ideal of the model Ix, 
and we then saturate that sum by the ideal of c x c minors of J. See [60, Equation (2.1)]. 
The result is the critical ideal Cx u of the model X with respect to the data point u. 


Example 2.1 (Plane curves) Let X be the plane curve defined by a polynomial f (x1, x2). 
We wish to compute the Euclidean distance from X to a given point u = (u1, u2) € R?. To 
this end, we form the augmented Jacobian matrix. This matrix is square of size 2 x 2: 


X1 = Uul X2— U2 


AT = \arian afal 


(2.2) 


We get the critical ideal from f and the determinant of AJ by performing a saturation step: 
Cxu = (f, det(AT)) : (df/dx1,0f/dx2)”. (2.3) 


The ideal Cx, lives in R[x1, x2]. See [52, Section 4.4] for saturation and other ideal 
operations. Frequently, the coefficients of f and the coordinates of u are rational numbers, 
and we can perform the computation purely symbolically in Q[x1, x2]. The saturation step 
in (2.3) removes points that are singular on the curve X = V(/f). If X is smooth then 
saturation is unnecessary, and we simply have Cx, u = ( f, det(AZ) ). On the other hand, 
if X has singular points, then we must saturate. For a concrete example take the cardioid 


f= (x7 + x2 + x)" — EA + x3), (2.4) 


and fix a random point u = (u1, u2). The ideal ( f, det(A J) ) is the intersection of Cx.u 
and an (x1,x2)-primary ideal of multiplicity 3. The critical ideal Cx,u has three distinct 
complex zeros. We can express their coordinates in radicals in the given numbers u4, u2. 
The following code computes the degree of the critical ideal Cy y of the cardioid X for 

the data point u = (2, 1). The code is written for the software Macaulay2 [73]. 

R = QQ[x1, x2]; 

ul = 2; u2 = 1; 

f = (x1^2 + x242 + x2)^2 - (x1^2 + x2^2); 

AJ = matrix {{x1- ul, x2 - u2}, {diff(x1, f), diff(x2, f)}}; 

I = ideal {f, det AJ}; 

C = saturate(I, ideal {diff(x1, f), diff(x2, f)}); 

degree C 


Saturation of ideals is discussed in Chapter 3. See (3.1) for the definition of saturation. © 


The variety V (Cx,u) is the set of complex critical points for the ED problem (2.1). For 
a random data point u, this variety is a finite subset of C”, and it contains the optimal 
solution x“, provided the latter is attained at a smooth point of X. It was proved in [60] that 
the number of critical points, i.e. the cardinality of the variety V (Cxu), is independent of 
u, provided we assume that the data point u is sufficiently general. See also Example 3.20. 
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Definition 2.2 The Euclidean distance degree (ED degree) of the variety X is the number 
of complex points in the variety V (Cxu), for generic data points u. We write this as 


EDdegree(X) := #V(Cx,u). 


In Example 2.1 we examined a quartic curve whose ED degree equals 3. The ED degree of 
a variety X measures the difficulty of solving the ED problem (2.1) using exact algebraic 
methods. The ED degree is an important complexity measure in metric algebraic geometry. 


Example 2.3 (Space curves) Fix n = 3 and let X be the curve in R? defined by two general 
polynomials fı and fọ of degrees dı and dz in three unknowns x1, x2,.x3. The augmented 
Jacobian matrix has format 3 x 3, and we compute it as follows: 


X1 — uy X2 — u2 X3 — U3 
AT = |ðfı/ðxı ðfı/ðxı ðfı/ðx3z]|. (2.5) 
ô f2/Ox| 0 fa [8x2 ô f2/0x3 


Fix a general data vector u € R?. Then the critical ideal equals Cy y = ( Sis f2, tl AT )). 
Hence, the set of critical points is the intersection of three surfaces. These surfaces have 
degrees dı, dz and dı + d2 — 1. By Bézout’s Theorem [133, Theorem 2.16], the expected 
number of complex solutions to the critical equations is the product of these degrees. 
Hence, the ED degree of the curve X equals d\d2(d, + dz — 1). 

The same formula can be derived from a formula for general curves in terms of algebraic 
geometry data. Let X be a general smooth curve of degree d and genus g in any ambient 
space R”. By [60, Corollary 5.9], we have EDdegree(X) = 3d + 2g — 2. Our curve in 
3-space has degree d = dı dz and genus g = djdy/2+ dd; /2—2d)d, +1. We conclude that 


EDdegree(X) = 3d +2g -2 = dıd2(dı + d2 — 1). 


This formula also covers the case of plane curves (cf. Example 2.1). Namely, if we set dı = d 
and dz = | then we see that a general plane curve X of degree d has EDdegree(X) = d?. 
In particular, a general plane quartic has ED degree 16. However, that number can drop a 
lot for curves that are special. For the cardioid in (2.4) the ED degree drops from 16 to 3.0 


Here is a general upper bound on the ED degree in terms of the given polynomials. 


Proposition 2.4 Let X be a variety of codimension c in R" whose ideal Ix is generated by 
polynomials fi, fo,..., fos... fk of degrees di > dz > ++- > de =--++ > dy. Then 


EDdegree(X) < didy++-de- >" (dy -= 1)" (d2 - 1)” --- (de -1)*. (2.6) 


Ij t+i2+-+:+ice <N-C 
Equality holds when X is a generic complete intersection of codimension c (hence c = k). 


Proof This appears in [60, Proposition 2.6]. We can derive it as follows. Bézout’s Theorem 
ensures that the degree of the variety X is at most dıd2 ++ de. The entries in the ith row 
of the matrix AJ are polynomials of degrees d;_; — 1. The degree of the variety of 
(c +1) x (c +1) minors of AY is at most the sum in (2.6). This follows from the 
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Giambelli-Thom-Porteous formula, which expresses the degree of a determinantal variety 
in terms of symmetric functions. The intersection of that determinantal variety with X is 
our set of critical points. The cardinality of that set is at most the product of the two degrees. 
Generically, that intersection is a complete intersection and equality holds in (2.6). oO 


Formulas and bounds for the ED degree are important when studying exact solutions 
to the optimization problem (2.1). The paradigm is to compute all complex critical points, 
by either symbolic or numerical methods (cf. Chapter 3), and to then extract one’s favorite 
real solutions among these. This reveals all local minima in (2.1). The ED degree is an 
upper bound on the number of real critical points, but this bound is generally not tight. 


ates aaa 


PiL 


Fig. 2.1: ED problems on the Trott curve: configurations of eight (left) or ten (right) critical real points. 
Data points are yellow, local minimal are green, and local maxima are red. The coordinates of the critical 
points are computed by solving the critical equations in (2.8). 


Example 2.5 Consider the case n = 2, c = 1, dı = 4 in Proposition 2.4, where X is a generic 
quartic curve in the plane R?. The number of complex critical points is EDdegree(X) = 16. 
But, they cannot be all real. For an illustration, consider the Trott curve X = V(f), given by 


f = 144 (cf + x3) — 225(x? +25) + 350x7x5 + 81. (2.7) 
This curve is shown in Figure 2.1. Given data u = (u1, u2) in R’, the critical equations are 


Of 


nate (2.8) 


(xı - uot — (x2 — u2) 
X2 

Assuming u to be a general point, these two quartic equations have 16 distinct complex 

solutions, and these are all critical points in X. Since the Trott curve is smooth, the saturation 
step in (2.3) is not needed when computing the ideal Cx y. 

The ED degree 16 is an upper bound for the number of real critical points of the 

optimization problem (2.1) on the Trott curve X for any data point u. The actual number of 

real critical points depends heavily on the specific location of u. For data u near the origin, 
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eight of the 16 points in V(Cx,y) are real. For u = (Z, wm) which is inside the rightmost 
oval, there are 10 real critical points. The two scenarios are shown in Figure 2.1. Local 
minima are green, while local maxima are red. Finally, consider u = (2, ww) which lies to 
the right of the rightmost oval. Here, the number of real critical points is 12. © 


In general, our task is to compute the complex zeros of the critical ideal Cy y. Algorithms 
for this computation can be either symbolic or numerical. Symbolic methods usually rest 
on the construction of a Grdbner basis, to be followed by a floating point computation 
to extract the solutions. In recent years, numerical methods have become popular. These 
are based on homotopy continuation. We explain how homotopy continuation works in 
Chapter 3. That chapter will be devoted to computations. 


Example 2.6 Let us revisit the cardioid from Example 2.1. For u = (2, 1) we compute the 
three critical points using the Julia [20] software HomotopyContinuation. jl [31]. 

using HomotopyContinuation, LinearAlgebra 

@var x1 x2 

ul = 2; u2 = 1; 

f = (x142 + x242 + x2)42 - (x1^2 + x242); 

AJ = [[x1 - ul; x2 - u2] differentiate(f, [x1; x2])]; 

solve([f; det(AJ)]) © 


We next illustrate current capabilities. 


Example 2.7 Suppose X is defined by c = k = 3 random polynomials in n = 7 variables, 
of degrees d1, d2, d3. The table below lists the ED degree in each case, and the times used 
by HomotopyContinuation. j1 to compute and certify all critical points in C’. 


dı dz d3 322 332 333 422 432 433 442 443 
EDdegree 1188 3618 9477 4176 10152 23220 23392 49872 
Solve (sec) 3.849 21.06 61.51 31.51 103.5 280.0 351.5 859.3 
Certify (sec) 0.390 1.549 4.653 2.762 7.591 17.16 21.65 50.07 


These ED degrees can be checked with Proposition 2.4. In our computation, the critical 
ideal Cx u represents a system of 10 equations in 10 variables. In addition to the three 
equations fi = f2 = fs = 0 in the 7 variables x1, . . . ,x7, we take the seven equations 


(1, y1, y2, y3) AT =0. 


In these equations, y1, y2, y3 are new variables. The three additional equations ensure that 
the 4 x 7 matrix AJ has rank < 3. This formulation avoids the listing of all (3) =35 
maximal minors. It is the preferred representation of determinantal varieties in the setting 
of numerical algebraic geometry. 

The timings above refer to computing all complex solutions to the system of 10 equa- 
tions in 10 variables. They include the certification step, as described in [30], which 
proves correctness and completeness. These computations were performed with the soft- 
ware HomotopyContinuation. jl [31] ona 16 GB MacBook Pro with an Intel Core 17 
processor working at 2.6 GHz. They suggest that our critical equations can be solved fast 
and reliably, with proof of correctness, when the ED degree is less than 50000. © 
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We now introduce an alternative version of the ED degree by using a different metric. 
We fix a weight vector A = (A),...,4,) E€ R” whose entries are positive real numbers. 
The weighted Euclidean metric is 


n 
2 eS 2 
lek 22") Ae, 
i=l 


If A is the all-one vector, then we obtain the usual Euclidean metric. For arbitrary A, we 
consider the analogue of the optimization problem (2.1): 


n 
minimize ||x — ull; = » Aj (xi — ui)” subject to x = (x1,...,Xn) E€ X. (2.9) 


iai 


Let u € R”. We obtain the A-augmented Jacobian matrix AJ, € RD” by placing 
the row vector (A1 (xı — u1), ...,An(Xn — Un)) atop the Jacobian matrix J. The critical 
ideal of the optimization problem (2.9) is then C$ u =I: J”, where I is generated by the 
(c +1) x (c + 1) minors of AY, together with the generators of Ix, and J is the ideal 
of c x c minors of J. This leads to the following definition. 


Definition 2.8 Let X c R” bea real variety, let A = (41, . . . , An) have positive entries, and 
let u € R” be a general point. The A-weighted Euclidean distance degree of the variety X 
is the number of points in the variety V(C$ a) For general weights A, this number is 
independent of A, and we call it the generic Euclidean distance degree of X, denoted 


EDdegree,.,(X) := #V (CX u) : 


We will discuss the generic ED degree for a class of matrix problems in the next section. 


2.2 Low-Rank Matrix Approximation 


A specific instance of the ED problem (2.1) arises when considering varieties of matrices 
of low rank that are constrained to have a special structure. Sometimes these matrices are 
flattenings of tensors. This version of the problem was studied in the article [142], which 
focuses on Hankel matrices, Sylvester matrices and generic subspaces of matrices, and 
which uses a weighted version of the Euclidean metric. In this section, we offer a brief 
introduction to this special case of our general ED problem. Our point of departure is the 
following low-rank approximation problem for rectangular matrices: 


minimize ||A-U|/? = 2 X lay - uij)? subjectto rank(A) <r. (2.10) 
i=l j=l 


In this problem, we are given a real data matrix U = (uij) of format m X n, where m < n, 
and we wish to find a real matrix A = (a;;) of rank at most r that is closest to U. The 
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Euclidean norm on the space of matrices is sometimes called the Frobenius norm. We can 
write this as ||A||? = Trace(A A). Indeed, if a,,...,a, denote the columns of A, then the 
diagonal entries of AT A are (a;,a;) fori = 1,...,n. This implies 


n m 


Trace( ATA) = Xa, a;) = > > a? = |[All?. 
i=l i=l j=l 
The solution to (2.10) is given by the singular value decomposition 


U = Ti - diag(o|, 02,..-, Om) h. 


Here T; and 7> are orthogonal matrices, and the real numbers in the diagonal matrix satisfy 
01 > 02 2 ++: = Om = O. These are the singular values of U. The following well-known 
theorem from numerical linear algebra concerns the variety of m X n matrices of rank < r. 


Theorem 2.9 (Eckart-Young) The closest matrix of rank < r to the given matrix U equals 
U* = T- diag(oi,...,0-,0,...,0)- To. (2.11) 


It is the unique local minimum for generic U. All complex critical points are real. They are 
found by substituting zeros for m—r entries of diag(o1,..., 0m). Thus, EDdegree(X) = Mae 


We present a complete proof of the Eckart-Young Theorem in Section 9.2. 
In structured low-rank approximation, we are given a linear subspace £ c R”*”, and 
a data matrix U € L, and we wish to solve the following restricted optimization problem: 


minimize || A — U ||? = ` Y lej — uij)? subject to A € £ and rank(A) < r. (2.12) 
i=l j=l 


A best-case scenario would be the following: if U lies in £ then so does the SVD solution U* 
in (2.11). This happens for some linear subspaces £, including symmetric and circulant 
matrices. However, most subspaces £ of R”*” do not enjoy this property, and finding the 
global optimum in (2.12) can be quite difficult. Our discussion follows the article [142], 
which studies this optimization problem for both generic and special subspaces L. 

We consider structured low-rank approximation using a weighted Euclidean metric. Our 
primary task is to compute the number of complex critical points of (2.12). Thus, we seek 
to find the Euclidean distance degree (ED degree) of the determinantal variety 


Le := {AeL|rank(A)<r} c R”. 


We use a A-weighted Euclidean distance coming from the ambient matrix space R”. We 
write EDdegree,(£<,) for the A-weighted Euclidean distance degree of the variety L<r 
(see Definition 2.8). The importance of the weights A is highlighted in [60, Example 3.2], 
for the seemingly harmless situation when £ is the space of all symmetric matrices in R™”. 

Of special interest are the usual Euclidean distance degree, denoted by EDdegree(L <r), 
when A = 1 is the all-one matrix, and the generic ED degree EDdegree,,,(£<,), when 
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the weight matrix A is generic. The generic ED degree is given by a formula that rests 
on intersection theory. See [60, Theorem 7.7] and Theorem 2.13 below. Indeed, choosing 
the weights 2;; to be generic ensures that the projective closure of £<, has transversal 
intersection with the isotropic quadric { A € P™””-! | pee viet Aijaz; =0}. 

We next present illustrations for the concepts above. These can also serve as examples 
for Theorem 2.13 below, as seen by the Macaulay2 [73] calculation in Example 2.22. 


Example 2.10 Let m = n = 3 and £ C R? the 5-dimensional space of Hankel matrices: 


ao a; a2 uo Uy U2 A0 Ay A2 
A = |a a2 a3|, U = |u, u u3|, and A= |A A% A3]. 
a2 43 a4 U U3 U4 A2 A3 A4 


Our goal is to solve the following constrained optimization problem for r = 1 and r = 2: 


minimize Aọ(aọ — uo)? +24ı(aı - u1)? +342 (a2 — ur)? + 2A3(a3 — u3)? + A4(a4 - u4) 
subject to rank(A) < r. 


This can be rephrased as an unconstrained optimization problem. For instance, for rank 
r = 1, we get a one-to-one parametrization of £<, by setting a; = st’ for i = 0, 1,2,3,4. 
Our optimization problem is then as follows: 


minimize o(s — uo)? +24 (st — u1)? + 3A (st? — u2)? + 2A3(st? — u3)? + Aa(st* — aay 


The ED degree is the number of critical points in this unconstrained minimization problem, 
where we require t + 0. We consider this problem for three weight matrices: 


111 1 1/2 1/3 122 
1=]ļ111|, Q=|1/21/31/2], © = |222I. 
111 1/3 1/2 1 221 


Here Q gives the usual Euclidean metric when £ is identified with R°. The last weight 
matrix © arises from identifying £ with the space of symmetric 2 x 2 x 2 x 2-tensors; see 
Chapter 12 for a study of metrics in the space of tensors. We compute 


EDdegree;(£<1) =6, EDdegreeg(£<1)=10, EDdegreeg(£<1) = 4, 
EDdegree;(£<2) =9, EDdegreeg(£<2) = 13, EDdegreeg(£<2) =7. 


In both cases, Q exhibits the generic behavior, so we have 
EDdegrees.,(L<r) = EDdegreeg(L£<;). 
See [142, Sections 3 and 4] for larger Hankel matrices and formulas for their ED degrees. 


Remark 2.11 The distance given by the above weight matrix © yields the Bombieri—Weyl 
distance. This plays an important role in our study of condition numbers in Chapter 9. 
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Example 2.12 Let m = n = 3,r = 1 and £ = R?®. Thus, we consider the weighted rank- 
one approximation problem for 3 x 3-matrices. We know that EDdegree,.,(£<1) = 39; 
see [60, Example 7.10]. We take a circulant data matrix and a circulant weight matrix: 


-59 11 59 961 
U=]| 11 59-59 and A=]ļ|619|. 
59 -59 11 196 


This instance has 39 complex critical points. Of these, 19 are real, and 7 are local minima: 


2.7921 94.3235-52.2007 2.7921 —1.5452 0.0826 28.8890- 1.5452-52.2007 
—1.5452-52.2007 28.8890 94.3235-52.2007 2.7921] | -1.5452 0.0826 2.7921 


>, 


| 0.0826 2.7921 S| [ae Se noe E 2.7921 94.3235 


—32.7508 39.6968—29.8794 36.2165—27.2599—29.8794 36.2165 39.6968—48.1160 


—29.8794 36.2165—27.2599| |—48.1160 36.2165 39.6968] |—29.8794—32.7508 39.6968 
39.6968—48.1160 36.2165 39.6968—29.8794—32.7508] |—27.2599—29.8794 36.2165 


—25.375 —25.375 —25.375 


-25.375 —25.375 -25.375 
~25.375 -25.375 -25.375| 


The first three are the global minima in our ED distance problem. The last matrix is the 
local minimum where the objective function has the largest value: note that each entry 
equals —203/8. The entries of the first six matrices are algebraic numbers of degree 10 
over Q. For instance, the two upper left entries 0.0826 and —48.1160 are among the four 
real roots of the irreducible polynomial 


164466028468224.x!° + 27858648335954688x° + 1602205386689376672x° 
+ 7285836260028875412x’ — 2198728936046680414272x° 
— 14854532690380098143152x° + 268867309 122837 10957623 16x* 


+ 446120944551 15888622678587x° — 413500804457 12457319337106x" 
+ 27039129499043116889674775x — 1977632463563766878765625. 


Here, the critical ideal in Q[x11, x12, . . . , x33] isnot prime. A computation in Macaulay2 
reveals that it is the intersection of six maximal ideals. The degrees of these maximal ideals 
over Q are 1, 2,6, 10, 10, 10. The sum of these numbers equals 39 = EDdegree,.,(L£<1).° 


Explicit formulas are derived in [142, Section 3] for EDdegree,.,(£<,) when £ is a 
generic subspace of R™” . This covers the four cases that arise by pairing affine subspaces 
or linear subspaces with either unit weights or generic weights. One important feature of 
determinantal varieties is that they are not complete intersections. This fact implies that 
their ED degrees are much smaller than suggested by the upper bound in Proposition 2.4. 
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2.3 Invitation to Polar Degrees 


We have introduced the ED degree of an algebraic variety X as a complexity measure for 
the ED problem in (2.1). The number 39 in the previous example served as an illustration 
on how the ED degree controls the number of critical points. But a deeper understanding 
is needed. In this section, we develop the algebro-geometric roots of the ED degree, which 
will then yield more advanced algorithms for finding it. Here is a key result for this: 


Theorem 2.13 Jf the given variety X meets both the hyperplane at infinity and the isotropic 
quadric transversally, then EDdegree(X) equals the sum of the polar degrees of the pro- 
jective closure of X. 


This theorem appears in [60, Proposition 6.10]. The hypothesis of the theorem is satisfied 
for all varieties X in R” after a general linear change of coordinates. This implies the 
following result for EDdegree sen (X), which is the quantity introduced in Definition 2.8. 


Corollary 2.14 The generic ED degree of a variety is the sum of the polar degrees of its 
projective closure. 


We shall explain all the terms used in Theorem 2.13. First of all, the projective closure of 
our affine real variety X c C” is its Zariski closure in complex projective space P”, which 
we also denote by X. Algebraically, P” is obtained from C” by adding one homogenizing 
coordinate x9. We identify the affine space C” with the open subset {x € P” | xo # O}. 
Its set complement {x € P” | x9 = 0} =~ P”! is the hyperplane at infinity inside P”. 
The hypersurface {x € P™! | X% x? = 0} is called the isotropic quadric. It lives in the 
hyperplane at infinity and it has no real points. The hypothesis in Theorem 2.13 means that 
the intersection of X with these two loci is reduced and has the expected dimension. 

If we are given a real projective variety X in P” from the start, then we also consider 
the ED problem for its affine cone in R”*!. For this problem, the data vector has n + 1 
coordinates, say U = (ug, U1, ..., Un). The augmented Jacobian AY is now redefined so 
as to respect the fact that all polynomials are homogeneous. The general formula for this 
matrix and the homogeneous critical ideal appear in [60, Equation (2.7)]. 

For a curve X c P* with defining polynomial f (xo, x1, x2), we use the 3 x 3 matrix 


uo uj Uu? 
AT = XO X1 X2 . 
Of /dxo Of /Ox, Of /Ox2 


The homogeneous critical ideal in R[xo, x1, x2] is computed as follows: 
Cxu = (f, det(AT)) : ((Af/Ax0, ð f/0x1,0f /Ox2) «(xj +x5))”. (2.13) 


The critical points are given by the variety V(Cx,y) in the complex projective plane P?. 
The cardinality of this variety equals EDdegree(X). The factor (x7 + x5) in the saturation 
step (2.13) is the isotropic quadric. It is needed whenever the hypothesis of Theorem 
2.13 is not satisfied. Namely, it removes any extraneous component that may arise from 
non-transversal intersection of the projective curve X with the isotropic quadric. 
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Example 2.15 (Cardioid) We consider the homogeneous version of the cardioid that was 
studied in Example 2.1. This is the curve X = V(f) in P? defined by 


f= ag +x + xo0x2)? = xala + x2). (2.14) 


This curve has three singular points, namely that at the origin V (x1, x2) in the affine plane 
C? = {xo + 0} and the two points in the isotropic quadric V(x? + x3) in P! = {xo = 0}. 
The homogeneous critical ideal Cx,u is generated by three cubics, and it defines seven 
points in P?. Hence the projective cardioid X has EDdegree(X) = 7. This is also the ED 
degree of the affine cardioid in (2.4) but only after a linear change of coordinates. We note 
that even a modest change of coordinates can have a dramatic impact on the ED degree. 
For instance, if we replace xı by 2x, in (2.4) then the ED degree jumps from 3 to 7. © 


We next define the polar degrees of a projective variety X c P”. Other definitions and 
many more details are given in Chapter 4. Recall that points in the dual projective space 
(P”)* represent hyperplanes in the primal space P”. The Euclidean bilinear form (+, -) is 
non-degenerate. Therefore, we can use it to identify P” and its dual space. To be precise, 
the point h € P” represents the hyperplane {x € P” | (h,x) = hoxo +--+ anxn = 0}. 


Definition 2.16 The conormal variety Nx C P” x P” is the Zariski closure of the set of all 
pairs (x, h) of points in P” x P” such that x is a nonsingular point of X and h represents a 
hyperplane that is tangent to X at x. 


It is known that the conormal variety Nx has dimension n — 1, and if X is irreducible then 
so is Ny. The image of Nx under projection onto the second factor is the dual variety XY; 
see Section 4.2. If X is a curve in the plane P?, then this yields the definition of dual curve 
from (1.7). The role of x and h in the definition of the conormal variety can be swapped. 
Theorem 4.11 from Chapter 4 implies that the following biduality relations hold: 


Ny =Nxv and (XY)Y =X. (2.15) 


The conormal variety is an object of algebraic geometry that offers the theoretical founda- 
tions for various aspects of duality in optimization, including primal-dual algorithms. 


Example 2.17 For a plane curve X = V(f) in P?, the conormal variety Ny is a curve 
in P? x P?. Its ideal is derived from the ideal that is generated by f and the 2 x 2 minors of 


ho hy hz 
ð f/x Of /Ax, Of [8x2] ` 


By saturation, as in (2.13), we remove singularities and points on the isotropic quadric. 
The result is the bihomogeneous prime ideal in R[xọo, x1, x2, ho, h1, h2] which defines Nx. 
The equation of the dual curve XY is obtained from this ideal by eliminating xo, x1, x2. 
For instance, if X is the homogeneous cardioid in (2.14) then its dual XY is the cubic 
curve defined by 16h; - 27hoh? - 24h? ho - 15hoh3 - 2h3. The ideal of the conormal 
curve Nx has ten minimal generators. In addition to the above generators of bidegrees 
(4,0) and (0, 3), we find the quadric xọho + x; h + x2h2 of bidegree (1, 1), three cubics of 
bidegree (2, 1) like xphi — 3x5) — xox1 h2 + 4x1x2h2, and four cubics of bidegree (1, 2).o 
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We now finally come to the polar degrees. The product of two projective spaces P”xP” 
serves as the ambient space for our primal-dual approach to the ED problem. We first 
consider its cohomology ring: 


H*(P"xP”, Z) = Zis, t]/(s"*!, "+y. 


The class of the conormal variety Nx in this cohomology ring is a binary form of degree 
n+1=codim(Nx) whose coefficients are nonnegative integers: 


[Nx] = 61(X)s"t + 62(X)s"'? + 63(X)s" 7 +--+ + 6, (X)st”. (2.16) 
In Macaulay2 one uses the command multidegree to compute this binary form. 
Definition 2.18 The coefficients 6;(X) in (2.16) are called the polar degrees of X. 


Remark 2.19 The polar degrees satisfy 6;(X) = #(Nx N (L x L’)), where L c P” and 
L’ c P” are general linear subspaces of dimensions n+ 1—i and i respectively. We will see 
this in Chapter 4. This geometric interpretation implies that 6;(X) = 0 for i < codim(X”) 
and for i > dim(X) + 1. Moreover, the first and last polar degree are the classical degrees: 


5(X) = degree(X) for i = dim(X) + 1, (2.17) 
i 7 degree(XY) for i = codim( X“). 


Example 2.20 Let X c P? be the cardioid in (2.14). The ideal of the curve Ny c P? x P? 
was derived in Example 2.17. From this ideal we compute the cohomology class 


[Nx] = degree(XY)- st + degree(X)-st? = 3-s*t + 4- st. 
Thus the polar degrees of the cardioid are 3 and 4. Their sum 7 is the ED degree. © 


Example 2.21 Let X be a general surface of degree d in P°. Its dual XY is a surface of 
degree d(d — 1)? in (P*)*. The conormal variety Nx is a surface in P? x (P3)*, with class 


[Nx] = d(d-1)?s*t + d(d-1) P + dst°. 
The sum of the three polar degrees is EDdegree(X) = d? — d? + d; see Proposition 2.4. © 


Theorem 2.13 allows us to compute the ED degree for many interesting varieties, 
e.g. using Chern classes [60, Theorem 5.8]. This is relevant for many applications, including 
those in machine learning, our topic in Chapter 10. Frequently, these applications involve 
low-rank approximation of matrices and tensors with special structure [33, 142]. 


Example 2.22 (Determinantal varieties) Let X = X, c P”°-1 be the projective variety 
of m x m matrices x = (x;;) of rank < r. We claim that its conormal variety Nx is cut out 
by matrix equations (here, the product symbol - denotes the multiplication of matrices): 


Nx = {(x,h) € P”! x Pp"! | rank(x) < r, rank(h) < m—r, x-h=0 and h-x = 0}. 


2.3 Invitation to Polar Degrees 25 


This follows from Lemma 9.12 in Chapter 9. In particular, among determinantal varieties 
the duality relation (X,)“ = X,m-r holds. We now type the above formula for Nx into 
Macaulay2 [73], for r = 1 and m = 3. With this, we compute the polar degrees as follows: 
QQ[x11,x12,x13,x21,x22,x23,x31,x32,x33, 
h11,h12,h13,h21,h22,h23 ,h31,h32,h33, 
Degrees=> {{1,0},{1,0},{1,0},{1,0},{1,0},{1,0},{1,0},{1,0}, {1,0}, 
{0,1}, {0,1}, {0,1}, {0,1}, {0,1}, {0,1}, {0,1}, {0,1}, {0, 133]; 
x = matrix {{x1ll,x12,x13},{x21,x22,x23}, {x31,x32,x33}}; 
h = matrix {{hl1,h12,h13}, {h21,h22,h23}, {h31,h32,h33}}; 
I = minors(2,x) + minors(3,h) + minors(1,x*h) + minors(1,h*x); 
isPrime(I), codim(I), degree I 
multidegree(I) 


The code starts with the bigraded coordinate ring of P® x PË. It verifies that Ny has 
codimension 9 and that I is its prime ideal. The last command computes the polar degrees: 


[Nx] = 35°¢ + 6572? + 129°? + 12t + 6549. (2.18) 
At this point, the reader may verify (2.17). Using Corollary 2.14, we conclude 


EDdegree,.,(X1) =3+6+124+12+6=39. 


gen 


Indeed, after changing coordinates, the EDdegree for 3 x3-matrices of rank 1 equals 39. We 
saw this in Example 2.12, where 39 critical points were found by numerical computation. 


The primal-dual set-up of conormal varieties allows for a very elegant formulation of 
the critical equations for the ED problem (2.1). This will be presented in the next theorem. 
We now assume that X is an irreducible variety defined by homogeneous polynomials in 
n variables. We view X as an affine cone in C”. Its dual Y = XY is the affine cone over 
the dual of the projective variety given by X. Thus Y is also an affine cone in C”. In this 
setting, the conormal variety Ny is viewed as an affine variety of dimension n in C*”. The 
homogeneous ideals of these cones are precisely the ideals we discussed above. 


Theorem 2.23 The ED problems for X and Y coincide: EDdegree(X) = EDdegree(Y). 
Given a general data point u € R", the critical equations for this ED problem are: 


(x,h) € Nx and x+h=u. (2.19) 


Proof See (60, Theorem 5.2]. o 


It is instructive to verify Theorem 2.23 for the low-rank variety in Example 2.22. For 
any data matrix u of size m X m, the sum in (2.19) is a special decomposition of u, namely 
as a matrix x of rank r plus a matrix h of rank m — r. By the Eckart-Young Theorem 2.9, 
this arises from zeroing out complementary singular values g; in the two matrices x and h. 

In general, there is no free lunch, even with a simple formulation like (2.19). The 
difficulty lies in computing the ideal of the conormal variety Nx. However, this should be 
thought of as a preprocessing step, to be carried out only once per model X. If an efficient 
presentation of Nx is available, then our task is to solve the system x + h = u of n linear 
equations in the 2n coordinates on Nx, now viewed as an n-dimensional affine variety. 
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The discussion in this section was restricted to the Euclidean norm. But, we can measure 
distances in R” with any other norm || - ||. Our optimization problem (2.1) extends naturally: 


minimize ||x — ul| subject to x € X. (2.20) 


The unit ball B = {x € R” | ||x|| < 1} in the chosen norm is a centrally symmetric convex 
body. Conversely, every centrally symmetric convex body B in R” defines a norm, and we 
can paraphrase the previous optimization problem as follows: 


minimize 2 subject to 2 > 0 and (u+AB) N X # 0. 


We will meet minimization problems for unit balls that are polyhedra in Chapter 5. 

If the boundary of the unit ball B is smooth and algebraic then we can express the critical 
equations for the corresponding norm as a polynomial system. This is derived as before, 
but we now replace the first row of the augmented Jacobian matrix AJ with the gradient 
vector of the map R” — R, x + ||x — ul|*. In conclusion, this chapter was dedicated to 
computing the minimal distance from a data point to a given variety for the Euclidean norm. 
The algebraic machinery we developed can be applied to much more general scenarios. 
For instance, Kubjas, Kuznetsova and Sodomaco [114] study this topic for p-norms. 
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In this chapter, we study two computational approaches to solve a system of polynomial 
equations. A system of m polynomial equations in n variables can be written in the form 


fix) 
F(x) := 


II 
= 


fa) 


where 
Fisccss tm E C[x] = C[x1,...,Xn]. 


If n = m, we call F(x) a square system. If n > m, then we call F(x) underdetermined. If 
n < m, we call F(x) overdetermined. Here, we will mostly focus on square systems. 


Example 3.1 The previous chapter considered constrained optimization problems of the 
form minxeR”:¢(x)-0 f(x), where f and g are polynomials in n variables x = (x1,...,Xn). 
To solve this problem, one can compute the solutions of the critical equations 

f _ Of 


ô Og B , Og 7 
a(x) = Fo) +4- FE (x) == SE) ta FE) = 0. 


This is a square system in the n + 1 variables (x, 2), where A is a Lagrange multiplier. © 


Solving the system F(x) = 0 means that we compute all points z = (Z1,...,Zn) € C” 
such that F(z) = 0. The first step is to find an appropriate data structure to represent a so- 
lution. In fact, F(z) = 0 is already implicitly represented by its equation. Some information 
can be read off from this representation. For instance, if F has rational coefficients and we 
know that F(z) = 0 has only finitely many complex solutions, then each coordinate z; of z 
is an algebraic number. On the other hand, further information, like whether or not there is 
areal solution z € R”, is not directly accessible from this implicit representation. 

Our aim in this chapter is to introduce two data structures for representing solutions of 
systems of polynomial equations: the first is Grobner bases and the second is approximate 
zeros. Readers who are familiar with these concepts can skip ahead to the next chapter. 
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Remark 3.2 In Chapter 2 we formulated critical equations for optimization problems like 
the ED problem (2.1). In this chapter, we present methods for solving these equations. 
The field of polynomial optimization rests on an alternative approach, namely one employs 
relaxations based on moments and sums of squares, and these are solved using semidefinite 
programming. We refer to the book [143] for an introduction to polynomial optimization. 


3.1 Gröbner Bases 


We use the notation x” := x" -++ xn” for the monomial defined by the exponent vector 
æ = (@1,...,@n) E N”. We can identify monomials with their exponent vectors. A 
monomial order > on C[x] is then defined by a total order > on N” that satisfies (1) if a > £, 
then œ + y > p + y for every y € N” and (2) every non-empty subset of N” has a smallest 
element under > (see, e.g., [52, Chapter 2 §2, Definition 1]). 


Example 3.3 The Lex (Lexicographic) order on the set of monomials in n variables is 
defined by setting a > £ if a; — B; > 0 for a, p € N", where j := min{i | a; # Bi} is the 
first index where «œ and £ are not equal. For instance, eee > Awad. Intuitively speaking, 
the Lex order views a polynomial f € C[x] as a polynomial in x; with coefficients that are 
polynomials in x2, which has coefficients that are polynomials in x3, and so on. © 


A monomial order induces the notion of leading term. Let f = cax® + cpxP tes CyX”, 
where a > B >--- > y and cg + 0. Then, the leading term of f is LT(f) := cgx®. The 
leading term ideal of an ideal I in C[x] is defined as 


LT(J) := ({LT(f) | f € 1\{O}}). 


Definition 3.4 (Grébner basis) Fix an ideal J c C[x] and a monomial order >. A finite 
subset {g1,...,8s} C I is a Grobner basis for I with respect to > if its leading terms 
generate the leading term ideal: 


(LT(gi),...,LT(gs)) = LT(/). 


If G is a Grobner basis for an ideal 7, then G generates /; i.e., Z = (G). This justifies the 
name “basis”, here chosen to mean “generating set”. (See [52, Chapter 2 §5, Corollary 6].) 

Our next example is identical to that in [52, Chapter 2 §8, Example 2]. It illustrates how 
Grobner bases can be used to solve systems of polynomial equations. 


Example 3.5 Consider the following system of three equations in three variables: 
x+y? +z -1 


F(x, y,z) = x +z -y = 0. 
x-z 


We compute a Gröbner basis of the ideal J = (x? + y? + z? — 1,x? +z? — y,x — z) with 
respect to the Lex order with x > y > z using Macaulay2 [73]. The code is as follows: 
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R = QQ[x, y, z, MonomialOrder => Lex]; 
f = x42 + y42 + z^2 - 1; 

g = x42 + Z42 - y; 

h=x - Z; 

I = ideal {f, g, h}; 

G = gb I; 

gens G 


This computes the Grobner basis G = {x-z, y—2z”, 4z++2z7-1}. Since J = (G), we can 
solve F(x, y, z) = 0 by solving the equations given by G. Notice that the third polynomial 
in G only depends on z, the second only on y and z, and the first only on x and z. Thus, 
solving F(x, y,z) = 0 reduces to solving three univariate equations. This gives the four 
solutions (a, 2a”, a), where a is one of the four roots of 4744277 -1. ° 


The reason why using the Lex order in Example 3.5 works well is the Elimination 
Theorem. To state this, consider any ideal J c C[x]. For every 0 < j < n, the intersection 
I; := 1 A Clxj41,...,Xn] is an ideal in a polynomial subring. It consists of those polyno- 
mials in 7 that only contain the variables x j+1,...,n. We call J; the j-th elimination ideal 
of I. For a proof of the next theorem see [52, Chapter 3, §1, Theorem 1]. 


Theorem 3.6 (The Elimination Theorem) Let J c C[x] be an ideal and let G be a Grob- 
ner basis for I with respect to the Lex order with x, > +++ > xn. Then GO C[Xj41,-.-Xn] 
is a Grobner basis of the j-th elimination ideal of I. 


For us, the most important consequence of the Elimination Theorem is that, if a system 
of polynomial equations F(x) = (f1(x),..., fin(x)) = 0 has finitely many solutions, then 
the j-th elimination ideal will not be empty for 0 < j < n. Consequently, we can solve 
F(x) = 0 by computing a Grobner basis for the Lex order and then sequentially solv- 
ing univariate equations. We can compute zeros of univariate polynomials by computing 
eigenvalues of the associated companion matrix. If f(x) = x4 + 2e cix? is a univariate 


polynomial, we have f(A) = 0 if and only if A is an eigenvalue of the companion matrix 


0--- 0 -co 
1 - 0 =Cj 

. Ee Coi, 
0- l —c4-1 


Sometimes we are not interested in the solutions of F(x) = 0 per se, but only in the 
number of solutions. Grébner bases naturally carry this information: Suppose J c C[x] is 
an ideal and > is a monomial order. Recall that a monomial x“ ¢ LT (J) is called a standard 
monomial of I relative to >. The next result shows how to get the number of solutions 
of F(x) = 0 from a Grobner basis; see [164, Proposition 2.1]. 


Proposition 3.7 Let I c C[x] be an ideal, > a monomial order, and 8 the set of standard 
monomials of I relative to >. Then, B is finite if and only if V(I) is finite, and #8 equals 
the number of points in V(I) counting multiplicities. 


Example 3.8 In Example 3.5, there are four standard monomials, namely, 1, z, z and z?. 
That is why the system F(x, y,z) = 0 has four solutions. We can view Proposition 3.7 
as an n-dimensional version of the Fundamental Theorem of Algebra, which says that a 
univariate polynomial of degree d has d zeros. © 
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We have now understood how to solve systems of polynomials with finitely many 
zeros using Grobner bases. What about systems whose variety has positive dimensional 
components? In that case, the n-th elimination ideal is necessarily the zero ideal. To cope 
with that case, one can remove positive dimensional components using ideal saturation. 
Let I, J c C[x] be two ideals. The saturation of I by J is the ideal 


I:J® := {f € C[x] | there is £ > 0 with f - g% € I forall g € J}. (3.1) 
Saturation is the ideal analogue of removing components on the level of varieties. We have 
V(U: J”) = VD AVO) (3.2) 


see [52, Chapter 4 §4, Corollary 11]. 
Recall that a solution to a square system F (x) = (f1(x),..., fn(x)) = 0O is called regular 


if the Jacobian determinant det ($4) does not vanish at that solution. There can 
J 


l<i,j<n 
only be finitely many regular zeros of a square system of polynomial equations, and a 
finite set of points is Zariski closed. Consequently, if we saturate J] = (f1,..., fn) by the 


ideal J = ( det ($£), then we can use the strategy from above to solve F(x) = 0. 
J 
Example 3.9 Consider the following system of two polynomials in two variables 


@-1) x-2: +yz-D\_ 
(y= T) $8) +y- 1) 
This system has four regular solutions, namely (1, 1), (1,3), (2, 1), (2, 3). In addition, the 


system has the circle x? + y? — 1 as a positive dimensional component. We use Macaulay2 
[73] to saturate the ideal generated by F. This is done as follows: 


F(x,y) = 0. 


R = QO[x, y, MonomialOrder => Lex]; 
f = (x-1) * (x-2) * (x^2+y^2-1); 

g = (y-1) * (y-3) * (x424y42-1); 

I = ideal {f, g}; 


Jac = matrix{{diff(x, f), diff(x, g)}, {diff(y, £), diff(y, g)}}; 
J ideal det(Jac); 
K saturate(I,J) 


This code returns the ideal K = (y? — 4y + 3,x* — 3x + 2). The two generators form a 
Grébner basis for K, with leading terms y? and x”. © 


Next, we present two propositions related to elimination and saturation of ideals with 
parameters, and one lemma on Grdbner bases of parametric ideals. For this, let x = 
(x1,-.--,Xn) and p = (p1,..., pk) be two sets of variables, and 


C[x, p] := C[x1,..., Xn, Pi,--->Pkl- 


We regard the coordinates p; of p as parameters. For a fixed parameter vector q € C*, we 
consider the surjective ring homomorphism 


bq: C[x,p] > Cix], f(x;p) > f(x;q). (3.3) 
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Proposition 3.10 Consider an ideal I C C[x, p] and let G = {g1,..., gs} be a Gröbner 
basis for I relative to the Lex order x; > +: > Xn > pı >+: > px. For1l <i < s with 
gi ¢ C[p], write g; in the form gi = c;(p)x™ + hi, where all terms of h; are strictly smaller 
than x% in Lex order. Let q € V(I N C[p]) € C¥ such that ci(q) # 0 for all g; ¢ C[p]. 
Then, the following set is a Gröbner basis for the ideal q(T) c C[x]: 


bq(G) = {94 (8i) | g: € C[p]}- 
Proof See, e.g., [52, Chapter 4 §7, Theorem 2]. o 


Example 3.11 (Cardioid revisited) As an illustration, fix n = k = 2 and let J be the 
ideal generated by the cubic (2.4) and the determinant of (2.2), with u1, u2 replaced by 
P1, p2. The lexicographic Gröbner basis G for this ideal has 15 elements. The 15 leading 
coefficients c; (p) are quite complicated. Three of them are 


(4p; +4p3 +4p2 + 1)’, 
32p5r (3p2 + 1)(3p2 +2)? (2p2 + 1)? (4p3 + Sp2 +2)? (1 + pp)’, 
P1P3r (8p) + 8p} p3 — 10p{ p- 18p3 — Tp} -21p} — 8p2 - 1) (pi + p +1) (pi - m - 1)’, 


where r = 4p2 +3. Suppose that we replace the unknowns p1, p2 by any complex numbers 
q1, q2 such that c;(q1, q2) + 0 for all i. Then the specialization ¢q(G) remains a Grobner 
basis with the same leading terms. In particular, the number of zeros (x1, x2), which is six 
when counted with multiplicities, is independent of the choice of parameters q1, q2. © 


In many applications, polynomial systems come with a non-degeneracy constraint. These 
are usually expressed in the form of a polynomial inequation A + 0. We now show how to 
incorporate such a constraint when solving a polynomial system. 


Proposition 3.12 Let I c C[x, p] be an ideal and let J = (hy be a principal ideal in the 
same polynomial ring. Let u be an additional variable and K := (1 — u - h}. Then, 


I: J® =(1+K)C{[x, p]. 


IfG is a Grobner basis of I+K relative to the Lex orderu > xı > ++- > Xn > pı > > Pk 
then G N C[x, p] is a Grobner basis of the saturation ideal I : J™. 


Proof See e.g. (52, Chapter 4 §4, Theorem 14]. o 


Example 3.13 Fix J from Example 3.11. We take K = (1 — u- h} where h = x1x2, since 
the cardioid (2.4) is singular at the origin. The Gröbner basis G has 27 elements, but 
with friendlier leading coefficients: PiP3(P1 + po+1)(p1 - p2- 1), p3(p2 + 1)?, P1P> 
p3(9p+ + ps + 2p3 = 9p? + p2), p3(p2 + 1), . . . Nonvanishing of these coefficients ensures 
that the system has 3 complex solutions, with multiplicities. © 


We now apply the technique above to identifying a discriminant A for a polynomial 
system that has parameters. The following result is proved in [23]. 
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Lemma 3.14 Let I c C[x, p] be an ideal and J = (h) C C[x, p] be a principal ideal such 
that (I: J®) N C[p] = {0}. Let {81,..., gs} be a Grobner basis of I : J® with respect to 
the Lex order xı > +++ > xn > pı > +++ > pp. There is a proper subvariety A ¢ C¥ such 
that, for all q ¢ A, the set {bq(g1),.--,q(gs)} is a Grobner basis for bg(1) : bq(J)™ and 


none of the leading terms of g1,..., gs vanish when evaluated at q. In particular, we have 


Pall: JE) = pa) : bq(J)” forall q ¢ A. 


Proof Let u be an additional variable. As in Proposition 3.12, we consider the ideal 
K := (1 — u - h}. Then, we have J : J® = (I+ K) N C[x,p]. Using our hypothesis 
that (Z : J®) N C[p] = {0}, we conclude 


V((1+K)NC[p]) = C. (3.4) 


We may therefore apply Proposition 3.10 to Z + K without any restrictions on q. As 
in Proposition 3.12, we augment the Lex order by letting u be the largest variable. Let 
G := {21...28} be a Gröbner basis of 7 + K relative to this order. By (3.4), we have 
that g1,...,g, ¢ C[p]. We write each g; in the form g; = c;(p)u®x% + hi, where all terms 
of h; are strictly smaller than ufx® , and define the hypersurface 


A := {q € Œ | ci(q) -++ c- (q) = 0}. (3.5) 


Now let q € C*\A. By Proposition 3.10, the set bq(G) = {bq(g1),---.¢q(gr)} is a 
Grobner basis for the ideal 


all +K) = palI) + bq(K) = a(l) + (1 —u- bq(h)). 


Without loss of generality, suppose the first s < r elements in G do not depend on the 
variable u. We define G := {g1,...,g5} = GM C[x, p]. It follows from Proposition 3.12 
that G is a Grébner basis of J : J”. Because q ¢ A, none of the leading terms in G when 
evaluated at q vanish. Consequently, we have ¢q(G) NC[x] = bq(G) N C[x]. Therefore, 
bq(G) = {bq(g1),---,¢q(gs)} is a Grobner basis of pq (7) : dq(J)™, by Proposition 3.12.0 


We refer to A as the discriminant of the pair (/, h). Equation (3.5) leads to the following: 


Corollary 3.15 Fix ideals I = (f\,..., fn) C C[x, p] and J = (h) c C[x, p]. The discrim- 
inant A in Lemma 3.14 is found by computing a Lex Grobner basis G for I + (1-—u-h), 
where u > xı > ++: > Xn > pı > ++: > pr. Namely, A is the product of the leading 
coefficients c;(p) in G. 


Example 3.16 (Hyperdeterminant) Let k = 8,n = 2 and consider a general pair of 
bilinear equations. Their ideal is 7 = (fi, fo), where 


fi = pixiX2 + p2x1 + p3x2+ p4 and f= p5xjx2 + pox + p7X2 + Ps. 


Our non-degeneracy constraint is h = 0 f,/0x, +0 f2/Ox2 — Of|/0x2- 0 f2/8x1. This is the 
Jacobian determinant of fı and f2. The Grobner basis G in Corollary 3.15 has 11 elements. 
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The most interesting Grobner basis element has leading coefficient 


c(P) =P1Ps - 2P1P2P7Ps — 2P1P3P6Ps — 2P1P4PsPs + 4P1PaPoPT + Pip? 
+ 4p>p3psps — 2p2P3P6P7 — 2P2P4PsP7 + P3Pe — 2P3P4PsPo6 + P4aPs- 


This coefficient is the main factor in our discriminant A. This quartic is the hyperdeterminant 
of the 2 x 2 x 2 tensor with entries p1, p2,..., ps- ° 


3.2 The Parameter Continuation Theorem 


The theorem to be proved in this section states that a square system of polynomial equations 
with parameters has a well-defined degree. This degree is the number of complex solutions 
for generic parameter choices. This result is the Parameter Continuation Theorem, due 
to Morgan and Sommese [135]. We shall present a proof that rests on the results on 
Gröbner bases in the previous section. For this, we consider again the polynomial ring 
C[x, p] = C[x1,...,%n, Pi,---, pk]. We interpret x as variables and p as parameters. 


Definition 3.17 Let fi (x; p),..., fa (x; p) € C[x, p]. We consider the image of the map 


fi (x; po) 
C% C[x]", po} F(x; po) = : 
fal; Po) 


This image is denoted F = {F(x;p) | p € C*}. Note that this is a family of square 
polynomial systems of size n. In other words, the family F consists of n polynomials in n 
variables that depend polynomially on k parameters. 


In Examples 3.11, 3.13 and 3.16, we studied families of polynomial systems in n = 2 
variables, where the number k of parameters was 2, 2 and 8, respectively. The degrees of 
these families are 6, 3 and 2. These degrees count the numbers of complex zeros for generic 
parameters p, which satisfy c; (p) + 0 for all i. 

We now fix a family F depending on k parameters p = (p1,..., px). Let z € C” bea 
zero of F(x; p) = (fi (x; p),.--, fa(X;p)) € F, for some specific parameters p € C*. We 


say that z is a regular zero if the Jacobian determinant det (g& does not vanish at 


xj ) l<i,j<n 
the point z. The next theorem is the main result of this section. It shows that, for almost all 


parameters p, the number of regular solutions is the same. 


Theorem 3.18 (The Parameter Continuation Theorem) Let F be a family of polynomial 
systems that consists of systems of n polynomials in n variables depending on k parameters. 
Forp € C*, denote by N (p) the number of regular zeros of F (x; p), and N := SUPpeck N(p). 
Then, N < œ, and there exists a proper algebraic subvariety A Ç CK, called discriminant 
of the system F, such that N(p) = N forall p ¢ A. 
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Proof We recall the proof from [23]. Another proof can also be found in the textbook [161]. 
Suppose F = {F (x; p) | p € C*}, where F(x; p) = (fi (© p), . - -» fa(&; p)) € F. Let 


T=(f\,...,fn) and J:= (det (54)). 


If N = 0, then no system in F has regular zeros. In this case, the statement is true. We 
now assume N > 0. By (3.2), the variety V(I : J®) consists of all pairs (x,q) € C” x C* 
such that x is a regular zero of F(x; q). Since N > 0, we therefore have V(I : J®) + 0. 
Let (x,q) € V(I : J®). The Implicit Function Theorem ensures that there is a Euclidean 
open neighborhood U of q such that F(x; q) has regular zeros for all q € U. Consequently, 


(Z: J") N C[p] = {0}, 


so we can apply Lemma 3.14 in our situation. 

Set Ig = ¢q(J) and Jq = ¢q(J). Let G = {g1,..., gs} be a Grobner basis of J : J” for 
the Lex order xı > +++ > Xn > pı >+: > px. By Lemma 3.14, there is a proper algebraic 
subvariety A ¢ C* such that bq(G) = {bq(81),---+Gq(¥s)} is a Grobner basis for Ig : Jg 
and none of the leading terms of g1, ..., gs vanish when evaluated at q. This implies that 
the leading monomials of Jy : J¢ are constant on C$ \ A. 

We consider the set of standard monomials, i.e. monomials not in the lexicographic 
initial ideal. We denote this set by 

Bq := {standard monomials of Ig : JY- 
We have shown that Bq is constant on CK\A. On the other hand, by (3.2) and since finite 
sets of points are Zariski closed, we have V(J : J”) = V(J)\V(J). Proposition 3.7 and 
the fact that regular zeros have multiplicity one imply that the following holds for all 
parameters q that are not in the discriminant A: N(q) = #8q. This shows that N (q) is 
constant on C*\ A. The Implicit Function Theorem implies that, for all q € C*, there exists a 
Euclidean neighborhood U of q such that N(q) < N(q’) for all q’ € U. Since A is a proper 
subvariety of C* and thus lower-dimensional, we have N = N(q) < œ for q € C\A. o 


We can use the algorithm in Corollary 3.15 to compute the discriminant. However, this 
does not necessarily yield the smallest hypersurface with the properties in Theorem 3.18. 

Nevertheless, the algorithm in Corollary 3.15 also returns the discriminant when 
F(x;p) = 0 has nonregular solutions for all parameters p. Resultant-based methods for 
computing the discriminant would fail in such cases because the resultant will be constant 
and equal to zero. Here is a simple example to illustrate this phenomenon. 


Example 3.19 We slightly modify the system from Example 3.9 and consider 


(x-1)-(#-2)-@+y?-1)} o 


F(x, y;a) = (y-1): y-a) x+y -1 ? 


where a € C is a parameter. If a ¢ {0, 1, +V—3}, then we have N = 4 regular solutions. 
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Let us compute the discriminant of the polynomial system F (x, y; a) using the algorithm 
in Corollary 3.15. We use the Macaulay2 code from [23]: 


R = QQ[u, x, y, a, MonomialOrder => Lex]; 

f = (x-1) * (x-2) * (x^2+y^2-1); 

g = (y-1) * (y-a) * (x^2+y^2-1); 

I = ideal {f, g}; 

Jac = matrix{{diff(x, f), diff(x, g)}, {diff(y, f), diff(y, g)}}; 
K = ideal {1 - u * det(Jac)}; 

G = gens gb (I+K); 

E = (entries(G))#0; 


QQfa][u, x, y, MonomialOrder => Lex]; 
result = apply(E, t -> leadCoefficient(sub(t, P))); 
factor(product result) 


The result is the polynomial 3538944 - a* - (a — 1)* - (a? +3)?. © 


Example 3.20 The critical equations for the ED problem in (2.1) have parameters u. In 
many situations, the critical equations form a square system, and Theorem 3.18 applies. 
The degree N is the Euclidean distance degree. For instance, in Example 2.3, we have 
a square system in n = 3 variables with k = 3 parameters, and the ED degree equals 
N = d,do(d, + dz + 1). What is the discriminant A in this case? ° 


Example 3.21 (Tact invariant) We consider two general quadratic equations in n = 2 
variables. Each equation has six coefficients, so there are k = 12 parameters in total: 


R = QO[x,y,a20,a11,a02,a10,a01,a00,b20,b11,b02,b10,b01,b00] ; 
f = a20*x42 + all*x*y + aO2 * y42 + aO1*x + alO*y + a00; 
g = b20*x42 + b11*x*y + bO2 * y^2 + bO1*x + b10*y + b00; 


For general parameters q, the set of lexicographic standard monomials is Bg = {1, y, yak 
Therefore, the number of solutions to our equations is N = 4. The main factor in the 
discriminant A is a polynomial in the 12 coefficients that is known as the tact invariant. 
This polynomial can be computed with the following Macaulay2 code: 


I = ideal(f, g, diff(x,f*diff(y,g) - diff(y,f)*diff(x,g)); 
tact = eliminate({x,y}, I) 


The tact invariant has degree 12 and is the sum of 3210 monomials. See also [156, $96]. © 


3.3 Polynomial Homotopy Continuation 


In Section 3.1, we have seen how to use Grobner bases to reduce solving F(x) = 0 to 
the problem of sequentially computing zeros of univariate polynomials. Another approach 
is polynomial homotopy continuation (PHC). This is a numerical method for computing 
the regular zeros of a square system of polynomial equations. The textbook of Sommese 
and Wampler [161] provides a detailed introduction to the theory of polynomial homotopy 
continuation. We also refer to the overview article [15]. This subject area is known as 
numerical algebraic geometry, and the present section offers a lightning introduction. 


36 3 Computations 


The goal in polynomial homotopy continuation is to compute approximate zeros. The 
following definition goes back to Smale (see [21, $8, Definition 1]). It rests on Newton’s 
method from numerical analysis. 


Definition 3.22 (Approximate Zeros) Let F(x) be a square system of polynomial equa- 
tions in n variables, and write JF (x) for its (square) Jacobian matrix. A point z € C” is 
called an approximate zero of F if the sequence of Newton iterates 


-1 
Zk+1 i= Zp —JF (zx) F(x) 
starting at Zo := Z converges to a zero of F. 


An approximate zero z of a polynomial system F is in a precise sense close to an actual 
zero x. Applying the Newton operator to z, we can get as close to x as we want. We can 
approximate x to any desired accuracy. 

Let F(x) = 0 be the system of polynomial equations that we want to solve. The idea in 
homotopy continuation is to find a family F and parameters p, q € C* with the properties 
that F(x) = F(x; p) and G(x) := F(x; q) is a system whose solutions are known or can 
be computed by other means. For a piecewise smooth path y(t) in C* with y(1) = q and 
y(0) = p, we define the parameter homotopy 


A(x,t) := F(x; y(t), 


and we track the solutions of F(x; q) = 0 to F(x; p) along the homotopy H. This tracking 
involves an ordinary differential equation (ODE). Namely, we use numerical algorithms to 
solve the ODE initial value problem 


(= | H(x, JE T + THa, t) = 0, x(1) =z. (3.6) 


Here the initial value z is a zero of G(x). In this setting, G(x) = F(x; q) is called start 
system and F(x) = F(x; p) is called target system. The output of the numerical solver is 
then an approximate zero of F(x). In implementations, one often uses piecewise linear 
paths, such as that described below. 


Remark 3.23 The left factor (£ H(x, t)) in (3.6) is the n x n Jacobian matrix. Throughout 
the tracking process, it is essential that this matrix is invertible. This means geometrically 
that our path must stay away from the discriminant A of the polynomial system. This is 
possible because A is a proper subvariety of the parameter space C*. The dimension of A 
over the real numbers R is an even number that is less than the real dimension 2k of the 
ambient space R** = C*. This ensures that the space C*\A is connected. 


There are several software packages for solving polynomial systems that are based on ho- 
motopy continuation. In this book we use the software HomotopyContinuation. jl [31]. 


Example 3.24 We solve the system of polynomial equations from Example 3.5. Recall that 
those equations are x? + y?+ 22-1 =x? +z -y=x-z=0. 
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The following are commands in the programming language Julia [20] on which the 
package HomotopyContinuation. jl is based: 


using HomotopyContinuation 
@var x y Z; 


f = x^2 + y^2 + z^2 - 1; 

g = x42 + z42 - y; 

h=x - Z; 

F = System([f; g; h], variables = [x; y; z]); 
solve(F) 


This code returns the four solutions (here displayed with only the 4 most significant digits): 


(0.556 +0.0V-1 , 0.618 — 0.0 V-I , 0.556 +0.0V-1 ), 

(- 0.0- 0.899V=1 , -1.618+0.0V-1 , -0.0-0.899V-1), 
( -0.556 - 0.0 V-I , 0.618 + 0.0V-1 , -0.556 + 0.0V-1 ), 
(0.0+0.899V-1 , -1.618 + 0.0V=1 , -0.0+0.899V-1). 


These are numerical approximations of the solutions found in Example 3.5. © 


Remark 3.25 The capabilities of HomotopyContinuation. jl were on display in Example 
2.7: we solved the critical equations for the ED problem on some complete intersections. 


Remark 3.26 Numerical computations are not exact computations and therefore can pro- 
duce errors. This is hence also true for polynomial homotopy continuation (PHC). It 
is possible, though, to certify the output of PHC. Certification means that we obtain a 
computer proof that we have indeed computed an approximate zero. There are various 
certification methods. Current implementations are [30, 83, 122]. 


The next proposition explains why polynomial homotopy continuation works and when 
the initial value problem from (3.6) is well-posed. Its proof relies crucially on the Parameter 
Continuation Theorem (Theorem 3.18), and on the connectedness result in Remark 3.23. 


Proposition 3.27 Let F be a family of polynomial systems with k parameters. Let N > 2 
and A be as in Theorem 3.18. Given q € C*\A and p € CK, for almost all choices of 
Pmia € CK, the piecewise linear path 


(y = | Ct- Da+20 —Dpaia yo sts 
YN ) pma + (1 — 28)p if 0<t<, 


satisfies: 


(a) (0,1) NA =O. 

(b) The homotopy H(x,t) := F(x; y(t)) defines N smooth curves x(t) with the property 
that H(x(t),t) =0 for 0 <t < 1. These curves are called solution paths. 

(c) As t — 0, the limits of the solution paths include all regular solutions of F(x; p) = 0. 

(d) If moreover y(0) ¢ A, then every solution path x(t) converges for t > 0 to a regular 
zero of F(x;p). 
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Proof Since N > 2, the discriminant A ¢ C* is a proper complex subvariety. Therefore, A 
is of complex codimension at least 1, hence of real codimension at least 2. This was a recap 
of Remark 3.23. It implies that, for general parameters Pmia € C*, the path y(t) does not 
intersect A for t € (0, 1]. This proves the first item. 

Since q = y(1) ¢ A, the Implicit Function Theorem ensures that there exists a Euclidean 
neighborhood Uo c C* of q and a smooth solution map sy : Uo —> C” such that 
F(so(p), p) = 0 for all p € Up. Let 


to := min {t € [0,1] | y(t) € Uo}, 


where Uo is the Euclidean closure of Uo. If to > 0, then y(to) ¢ A and we can repeat the 
construction for the new start system F(x; y(to)). Eventually, we obtain an open cover 


(0,1) = | Jt, 


icl 


for some index set J, together with smooth solution maps s; : U; — C”. Taking a partition 
of unity (p:(t))iez relative to this cover (see [121, Chapter 2]), we set 


x() = Dipl) (ion, te (0,11. 


iel 


The path x(t) is smooth and it satisfies H(x(t), t) = 0. Furthermore, as t — 0, the solution 
path x(t) either converges to a point z or it diverges as ||x;(t)|| — oo. In the first situation, 
by continuity, z is a zero of F(x, y(0)) = F(x, p), which is not necessarily regular. 

By Theorem 3.18, the system F (x; q) has N regular zeros. The construction above yields 
N solution paths x;(t),..., x(t). Smoothness implies that x;(t) + x;(t) fori + j and all 
t € (0, 1]. This proves the second item. Furthermore, for every regular zero of F(x; q), 
we also find a (local) solution map, which connects to exactly one of the smooth paths 
xı (t),..., Xn (t). This implies the third and fourth items. o 


Corollary 3.28 A general system F(x) = (fi (xX), . . . , fa(X)) ofn polynomials in n variables 
with d; = deg f; has 
N=d,---dy 


isolated zeros in C”. (The number d; - ++ dy is called the Bézout number of F.) 


Proof We consider the family Fgézou of polynomial systems F(x) = (f1(x),..., fax) 
with d; = deg f;. The parameters are the coefficients of the polynomials fi, ..., fa. Here, 
we can use the start system 


a -1 
G(x) = : 
xan -1 
This system has d,---d, distinct complex zeros, namely (tae wie ék), where é; := 


exp (2xv-1 /d;) is the d;-th root of unity and k; ranges from 1 to d;. One calls G(x) 
the total degree start system. All its zeros are regular, and it has no zeros at infinity. 
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Together with Proposition 3.27 this implies that the system G(x) has the maximal number 
N=d,---d, of regular zeros in Fpézout- o 


Remark 3.29 Corollary 3.28 only states that the number of solutions equals dı ---d, for 
systems outside the discriminant. The full version of Bézout’s theorem also applies to 
systems F € A, where it states that the number of zeros counted with multiplicities is 
dı +: - dn. Corollary 3.28 does not prove this full version. 


Corollary 3.28 implies that one can use the total degree start system for homotopy 
continuation in the family of systems of polynomials with fixed degree pattern. 


Example 3.30 The system F(x, y, z) = (x?+y?+z?-1,x?+z?—y, x—z) from Example 3.24 
consists of three polynomials of degrees dı = 2, d2 = 2 and d3 = 1. The number of zeros 
of F is the Bézout number N = d; - dz - d3 = 4. In HomotopyContinuation. jl [31], we 
can use the total degree start system by setting the following flag: 


solve(F; start_system = :total_degree) 


The default option in HomotopyContinuation. jl is the polyhedral homotopy. Here the 
start system is constructed from Newton polytopes, and it respects the mixed volume. © 


We now introduce sparse systems, their mixed volume, and the polyhedral homotopy. 


Example 3.31 Let A c N” be a finite set and denote Få := {YigcaCax” | Ca € Ch. 
An element in Få is called a sparse polynomial, since only the monomials with exponent 
vector in A appear. For finite subsets A,,..., An C N”, we consider the family 


F sparse = Fa, XX FA 


The parameters in this family are the coefficients of the n sparse polynomials of a system in 
F sparse. For 1 < i < n, let P; be the convex hull of A;. The polytope P; is the Newton polytope 
of the sparse polynomials in F4,. We write MV (P1, . . . , Pn) for the mixed volume of these n 
polytopes. The BKK Theorem [19] asserts that a general F € Fsparse has MV (P1, . . . , Pn) 
many zeros in the torus (C*)”. Assuming that such an F only has zeros with non-zero 
entries, the maximal number of regular zeros in F sparse therefore is 


N = MV(P\,...,Pn). 


If the supports A; are all the same, then there is only one Newton polytope. Let us denote 
it by P := Pı =--- = P,. In this situation, which occurs frequently in practice, the mixed 
volume MV(P,...,P) equals n! times the volume of P. This product is a nonnegative 
integer, and it is referred to as the normalized volume of P. Thus the BKK bound for 
unmixed square systems is the normalized volume of the Newton polytope. © 


Remark 3.32 The article [98] introduced a combinatorial algorithm for computing an ex- 
plicit start system for Fsparse, called polyhedral start system. See also [15, Section 3]. The 
use of the polyhedral start system is the default option in HomotopyContinuation. jl. 
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We close this chapter with a brief discussion for two quadratic equations in two variables. 
The general system appeared in Example 3.21, where we computed the tact invariant, 
which serves as the discriminant. The nonvanishing of the tact invariant ensures that 
the two equations have four distinct complex solutions. Suppose we begin with the total 
degree start system x? = x> = 1, which has four solutions (+1, +1). The homotopy in 
Proposition 3.27 is guaranteed to find the four solutions of the system we wish to solve. 


By contrast, suppose now that our two equations are sparse, and the system has the form 


a + bx + cy + dxy 


Py) = æ + Bx + yy +ôxy 


E F sparse ` 


Here p = (a, b, c, d, a, B, y, 8) € CÈ is a vector of parameters. Both polynomials in F(x, y) 
have the same Newton polytope P, namely the unit square. The normalized volume of the 
unit square equals MV (P) = 2. Therefore, the BKK Theorem tells us that F(x, y) = 0 has 
N = 2 solutions for general parameters p € CÈ. 

The total degree start system x? = ip = | is not appropriate for the sparse family F(x, y) 
because it has too many solutions. Two of the start solutions (+1, +1) lead to paths that di- 
verge when running the homotopy. Instead, a polyhedral start system [98] has precisely two 
solutions. A polyhedral start system is obtained by dividing the square P into two triangles, 
each of normalized area 1. The two zeros of F (x, y) are distinct when the discriminant does 
not vanish at p. The discriminant of the system F(x, y) is the hyperdeterminant of format 
2 x 2 x 2, which we saw in Example 3.16. In other words, the role of the tact invariant for 
two dense quadrics is now played by our hyperdeterminant. Hyperdeterminants of larger 
tensors, and the spectral theory of tensors, will be featured in Chapter 12. 
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Chapter 4 D 
Polar Degrees Check for 


| updates 


The notion of polar degrees is fundamental for assessing the algebraic complexity of 
polynomial optimization problems of a metric origin. We already recognized this point for 
Euclidean distance optimization in Section 2.3, and we will see it again in Theorem 5.5 for 
polyhedral norms, with focus on the Wasserstein metric from optimal transport theory. The 
punchline is that polar degrees govern linear programming over real algebraic varieties. 
This chapter offers a self-contained introduction to polar degrees, at a more leisurely 
pace, and with an emphasis on geometric intuition. We compare three definitions: in terms 
of non-transversal intersections, Schubert varieties and the Gauss map, and conormal 
varieties. The latter was used in Definition 2.16. We discuss the behavior of polar degrees 
under projective duality, and we explain how polar degrees are related to Chern classes. 
Throughout this chapter, we work over an algebraically closed field of characteristic zero. 


4.1 Polar Varieties 


Our starting point is a geometric definition of polar degrees, as the degrees of polar 
varieties. Recall from Definition 2.18 that we introduced the polar degrees 6;(X) of a 
projective variety X as the multidegrees of the conormal variety of X. Polar varieties offer 
an alternative realization. We will prove in Theorem 4.16 below that both multidegrees of 
the conormal variety and degrees of polar varieties yield the same definition. Until then, 
we use the symbol u;(X) to denote the i-th polar degree that comes from polar varieties. 
To illustrate the concept of polar varieties, we examine the case of surfaces in 3-space. 


Example 4.1 Imagine that you look at an algebraic surface X c P? from a point v € P°. If 
you want to sketch the surface from your point of view, then you would draw its contour 
curve P(X,v). This is illustrated in Figure 4.1. The contour curve consists of all points p 
on the surface X such that the line spanned by p and v is tangent to X at p. The first polar 
degree 1,(X) is the degree of the contour curve on X for a generic point v. 

Suppose that X is defined by a general homogeneous polynomial f(xo,x1,%2,x3) of 
degree d, and the viewpoint is v = (vo : vı : v2 : v3). To compute the contour curve, we 
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Fig. 4.1: The polar variety P (X, v) is the contour curve when the ellipsoid X is viewed from the point v. 


consider the directional derivative ôy f = bee vi Of /Ox;, which is a polynomial of degree 
d — 1. The contour curve P(X, v) is the variety in P? that is defined by f and 0, f. So, by 
Bézout’s Theorem, the degree of this curve is typically uı (X) = d(d — 1). 

Now, we change the setting slightly and imagine that our viewing of the surface X is 
not centered at a point but at a line V c P?. This time our contour set P(X, V) consists of 
all points p on the surface X such that the plane spanned by p and the line V is tangent 
at p; see Figure 4.2. For a generic line V, the contour set P(X, V) is finite. The cardinality 
of P(X, V) is the second polar degree u2(X). The finite variety P(X, V) is now defined by 
f = Of = Owf = 0, where v and w are two distinct points on the line V. Using Bézout’s 
Theorem again, we find that u2(X) = d(d — 1)? for a general surface X of degree d. © 


Fig. 4.2: The contour set consists of two points when the ellipsoid X is viewed from the line V. 


The contour sets described in the example above are known as polar varieties. To define 
polar varieties in general, we need to fix some conventions and notations. For instance, 
the dimension of the empty set is considered to be —1. Given two projective subspaces V 
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and W in P”, their projective span (equivalently, their join) is denoted by V+ W c P”. If 
the two subspaces V and W are disjoint in projective space then we have 


dim(V +W) = dim(V) + dim(W) + 1. 


Consider any projective variety X C P”, with homogeneous ideal /(X) in C[xo,..., Xn]. 
We write Reg(X) for the regular locus of X. We recall that a projective subspace W c P” 
is said to intersect X non-transversely at p € Reg(X) if p € W and dim(W + TpX) < n, 
where TpX denotes the embedded tangent space of X at p. For instance, if X is a smooth 
curve in P’, then every line that intersects it does so non-transversely, while the tangent 
planes of X are the only planes that meet X non-transversely. 


Definition 4.2 The polar variety of a variety X C P” with respect to a subspace V c P” is 
P(X,V) := {p€ Reg(X)\V | V+ p intersects X at p non-transversely}. 


Leti € {0,1,...,dim X}. If V is generic with dim V = codim X — 2 + i, then the degree of 
P(X, V) is independent of V, and we can define 


Hi(X) := deg(P(X,V)). 
The integer ju; (X) is called the i-th polar degree of X. 


Example 4.3 A surface X c P? has three polar degrees. For i = 0,1,2, the generic 
subspace V in Definition 4.2 is empty, a point, or a line, respectively. We saw the latter two 
cases in Example 4.1. For the case i = 0, we observe that P(X,@) = X. So, the 0-th polar 
degree uo(X) is the degree of the surface X. © 


Example 4.4 The identity uo(X) = deg(X) holds in general. If i = 0, then the dimension 
of the generic subspace V in Definition 4.2 is codim X — 2. Hence, for every p € Reg(X), 


dim ((V + p) + 7)X) = dim (V + 7X) 
dim V + dim X — dim(V N 7)X) 
(codim X — 2) + dim X + 1 


=n-l. 


IA 


This means that V +p intersects X at p non-transversely. We conclude that the polar variety 
for i = 0 is the variety itself. In symbols, we have P(X, V) = X, and so uo(X) = deg(X).o 


Example 4.5 The polar varieties of a plane curve X are P(X,@) = X and P(X,v) for a 
point v. The latter is the finite set of points on X whose tangent line passes through v. Those 
points are contained in the vanishing locus of Ay f, where f is the defining polynomial of 
the curve X and Ay is the differential operator in (1.6). Hence, the polar variety P(X, v) 
is contained in the intersection of X with its polar curve of order deg(X) — 1 as defined 
in Section 1.1. For a smooth curve X of degree d, that containment is an equality, and so 
Ly (X) = |P(X, v)| = d(d — 1) by Bézout’s Theorem. © 


We will now give a second definition of polar varieties in terms of the Gauss map and 
Schubert varieties. As before, we fix a projective subspace V C P”. We observe that V + p 
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intersects X at p € Reg(X) non-transversely (i.e., n > dim((V + p) + TpX)) if and only if 
dim(V N TpX) > dim V — codim X. (4.1) 


Since dim V — codim X is the expected dimension of the intersection of the two projective 
subspaces V and TpX, condition (4.1) means that the tangent space 7) X meets V in an 
unexpectedly large dimension. 

The subspaces that meet V unexpectedly form special instances of Schubert varieties: 


Ym(V) := {T € Gr(m,P”) | dim(V NT) > dimV —n+m}. (4.2) 


Here Gr(m, P”) is the Grassmannian of m-dimensional projective subspaces in P”. This is 
a smooth projective variety of dimension (m + 1)(n — m), embedded in PGi)! via the 
Pliicker embedding; see [133, Chapter 5]. For instance, if m = 1 and n = 3, we get the 
Grassmannian of lines in P’. As a projective variety, the Grassmannian Gr(1, P?) consists 
of all points x = (xo : X02 : X03 : X12 : X13 : X23) in the 5-dimensional ambient space p5 
that satisfy the quadratic Pliicker relation xo\x23 — x02x13 + X03X12 = 0. Such a point x 
represents the line spanned by points p and q in P? if x;; = pig; — pjqi fr 0 < i< j <3. 

Every variety X of dimension m in P” comes with a natural map to the Grassmannian: 


yx: X --» Gr(m,P"), p > DX. 


This is the Gauss map. It takes each regular point p on X to its tangent space TpX. If X 
is given by polynomial equations, then we compute yx by mapping p to the kernel of the 
Jacobian of the equations at p. If X is given by a parametrization, then we can represent 
the Gauss map yx by taking derivatives in the parametrization. 


Example 4.6 (Twisted cubic curve) Let X be the curve in P? with the parametrization 
v: PP oP, (5: ft) (8: st: st? : 2°). 


The tangent space of X at the point p = v((s : t)) is the line in P? spanned by the rows of 
the Jacobian matrix 


3s* 2st ° 0 
0 s 2st 327] ° 
The Pliicker coordinates of the tangent line yx(p) are the 2 x 2 minors of this 2 x 4 matrix. 
After dividing the minors by 3, we obtain 
X01 = st , X02 = 2s°t, X03 = 35727, X12 = S, X13 = 2st?, X23 = tt. (4.3) 
This shows that the image of the Gauss map of the twisted cubic curve X is a rational 
normal curve of degree four in the Grassmannian of lines Gr(1, P3) c P3. © 


We now turn to the Schubert varieties &,,(V). These varieties are cut out by linear 
equations in Pliicker coordinates. 


Example 4.7 (m = 1, = 3) We consider three types of Schubert varieties of lines. If V is 
a point in P’, then £;(V) = Gr(1,P°). If V is a plane in P’, then X; (V) is the surface of 
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all lines in that plane. Finally, suppose that V is a line in P?, with Pliicker coordinates v; je 
Then £; (V) is defined by the linear equation 


01X23 — V02X13 + V03X12 + V12X03 — V13X02 + V23X01 = O. (4.4) 


This Schubert variety is a threefold. It consists of all lines that intersect the given line V. o 
We now come to our punchline, which is the second definition of polar varieties. 


Proposition 4.8 Fix a variety X of dimension m in P”. For any subspace V of P”, the polar 
variety P(X, V) is the pullback of the Schubert variety X,(V) under the Gauss map, i.e. 


P(X,V) = yx! (Zm(V)). 


Proof The condition (4.1) is equivalent to TpX € Xm(V). o 


Example 4.9 (Twisted cubic curve) In Example 4.7, if V is a point in P?, then we have 
P(X, V) = yx! (Gr(1,P3)) = X. For a line V, we compute P(X, V) = yx! (21 (V)) by 
substituting (4.3) into (4.4). The result is a binary quartic in (s, t). This quartic has four 
zeros in P!. Their image under the map v is the polar variety P(X,V). Thus P(X, V) 
consists of four points. The polar degrees of our curve X are uo(X) = 3 and uı(X) =4. © 


Corollary 4.10 An inclusion of projective linear subspaces gives a reverse inclusion of 
polar varieties. To be precise, for any fixed variety X in P” and subspaces V C V’, we 
have P(X, V’) c P(X,V). 


Proof Use Proposition 4.8 and the inclusion Em(V’) C Em(V) of Schubert varieties. o 


4.2 Projective Duality 


Duality is a fundamental concept in many fields of mathematics, including convexity, 
optimization, and algebraic geometry. Primal-dual approaches in polynomial optimization 
can be understood via the duality theory of projective algebraic geometry. We saw this in 
Chapter 2, in the context of polynomial optimization, where the conormal variety plays a 
key role for computing the ED degree in Theorem 2.13. 

This section offers a second point of entry, by offering a self-contained introduction to 
duality theory of projective varieties, presented in a friendlier and more geometric way than 
in Section 2.3. There is a one-to-one correspondence between hyperplanes H in P” and 
points HY in the dual projective space (P”)*. We recall from our discussion in Section 2.3 
that the dual variety XY of a projective variety X © P” is the projection of the conormal 
variety Nx to the second factor. It parametrizes all tangent hyperplanes: 


XY := {HY e (P")* | dp € Reg(X) : T,X S H}. 


This generalizes the dual of a plane curve in (1.7). In Chapter 2 we used the Euclidean 
inner product to identify projective space with its dual space. Here, we defined the dual 
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variety without this identification. The following result is found in Section I.1.3 of the book 
by Gel’fand, Kapranov and Zelevinsky [71]. 


Theorem 4.11 (Biduality) Fix a projective variety X C P", and let p € Reg(X) and 
HY € Reg(X’). The hyperplane H is tangent to the variety X at the point p if and only if 
the hyperplane p is tangent to the dual variety XV at the point H”. In particular, we have 


(XY)Y =X. 


Example 4.12 If X is a projective subspace of P”, given as the row space of a matrix, 
then its dual XY is the subspace of (P”)* given by the kernel of that matrix, so dim XY = 
n- 1 — dim X. Note that (P”)Y = 0. © 


Example 4.13 (Twisted cubic curve) Letn = 3 and fix coordinates p = (po : pı : p2 : p3) 
on P? and H = (họ : Ay : hy : h3) on (P)*. The twisted cubic curve X in Example 4.6 is 
defined by the prime ideal 


2 
( PoP2 — Pi > POP3 — P1P2; P1P3— P3): 


Its dual variety XY is a quartic surface in (P?)*. The equation of XY is the discriminant of 
a binary cubic: 
27h} — 18hohıhzh3 + 4hoh} + 4h? hg — h? hi. (4.5) 


Metric geometry of discriminants will become important in Chapter 9. At this point, our 
readers are encouraged to check computationally that (XY)Y = X. © 


Example 4.14 Fix a surface X c P°. Figure 4.2 illustrates the polar variety P(X, V) for 
a generic line V. The tangent planes through V correspond in (P*)* to points on the dual 
variety XY that lie on the line V“; see Figure 4.3. Hence, if XY is a surface as well, then 
its degree is the second polar degree of X, i.e., u2(X) = deg(X”). Otherwise, if the dual 
variety XY is a curve, then the line VY misses it and jz2(X) = 0. This happens, for instance, 
when X is the quartic surface (4.5), now rewritten in primal coordinates p. 


tangent 


planes 


Fig. 4.3: A pair of dual surfaces X and X” in projective 3-space satisfies u2 (X) = deg(X”). 


Figure 4.1 shows the polar curve P(X, v) of a surface X c P? for a generic point v. This 
consists of all points p on X whose tangent plane contains v. We find u1 (X) = deg(P(X, v)) 
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by intersecting the curve with a generic plane H. Hence, the first polar degree jz; (X) counts 
all regular points p € X such that 


ped and ve XxX. (4.6) 
These tangent planes correspond to points q := (TpX)“ on the dual variety XY. Using the 
biduality in Theorem 4.11, we have T,X’ = p“ if the dual variety X“ is a surface. In that 


case, the conjunction in (4.6) is equivalent to 


HY € T4X* and qev’. (4.7) 


Fig. 4.4: The identity uı (X) = i (X) holds for pairs of dual surfaces in projective 3-space. 


Comparing (4.7) with (4.6), we see that the point-plane pair (HY, v“) imposes the same 
conditions on points q € XY as the point-plane pair (v, H) imposes on points p € X; 
see Figure 4.4. By the genericity of (v, H), we conclude that y)(X) = p(X”) if XY isa 
surface. If XY is a curve, then T,X” ¢ p“ and so the only condition imposed by (4.6) on the 
points q € X“ is that they must lie in v“. Here, (X) = |X“ A v“| = deg(XY) = po(XY). 
Finally, if XY is a point (i.e., X is a plane), then u (X) = 0. © 


The observed relations between the polar degrees of a variety and its dual are true 
in great generality. The sequence of polar degrees of a projective variety X equals the 
sequence of polar degrees of its dual variety XY in reversed order. As seen in Example 4.4, 
the first non-zero entry is deg(X), and its last non-zero entry is the degree of the dual XY. 

We summarize these key properties of polar degrees: 


Theorem 4.15 Let X be an irreducible projective variety, and 
a(X) := dim X — codim XY + 1. 


The following hold for the polar degrees of X: 


(a) u(X)>0 & OKsisa(X). 
(b) uo(X) = deg X. 

(c) Hax (X) = deg X”. 

(d) ui(X) = Ma(xy-i(X”). 
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The ideas discussed in Example 4.14 can be turned into formal proofs for almost all 
assertions in Theorem 4.15. The direction “<=” in (a) is a bit tricky. For details, we refer to 
the article by Holme [91]. Another proof strategy is to first establish the relation of the polar 
degrees with the conormal variety Nx (see Definition 2.16). Recall from Definition 2.18 
that we have defined the polar degrees of X to be the multidegrees of Nx. More precisely, 
for generic projective subspaces Lı and L2, we have 


6;(X) := |NxN (Lı x L2)|, where dim Lı =n+1-— j and dimL2 = j. 


We saw in Section 2.3 that the multidegree is the cohomology class of the conormal variety. 


Theorem 4.16 The multidegree agrees with the polar degrees. To be precise, we have 
ô; (X) = u:(X), where i := dim X +1 - j. 


This theorem and the biduality relation Ny = Nyxv in (2.15) imply Theorem 4.15 (d). 
Using Example 4.4, they also imply (b) and (c) in Theorem 4.15. The direction “=” in 
Theorem 4.15 (a) can also be deduced directly from the definition of the ô; (X). 


Example 4.17 We see from Figure 4.4 and conditions (4.6) and (4.7) that the first polar 
degree of a surface X in P? equals u1 (X) = |NxN(HXVY)| = 62(X). Hence Theorem 4.16 
holds for surfaces X in P’. © 


Proof (of Theorem 4.16) We present the main ideas of the proof. For complete proofs, we 
refer to [106, Proposition 3 on page 187] or [70, Lemma 2.23 on page 169]. 

First, we remark that the formulation of the conormal variety in Chapter 2 used the 
Euclidean inner product to identify projective space with its dual. In this chapter, we do 
not use the Euclidean structure. Instead, we formulate the relevant notions using abstract 
duality. The conormal variety is 


Nx = { (p, H“) € P” x (P")* | p € Reg(X), pX S H}. 


The projection of the conormal variety Nx onto the first factor P” is the variety X we 
started with. The projection onto the second factor (P”)* is the dual variety X“. 

We compute the multidegrees of Nx. Let Lı C P” and L2 C (P”)* be generic subspaces 
of dimensions n + 1 — j and j, respectively. Set V := LY. Note that the subspace V has the 
correct dimension for the computation of the i-th polar degree, where i = dim X+1-— j. This 
follows from dim V = n- j —1 = codim X —2+i. We now consider a generic point (p, H“) 
in the intersection Nx N (P” x L2). Then p is regular point of X. Both its tangent space T,X 
and V = Ly are contained in the hyperplane H. In particular, we have dim(V + 7)X) < n. 
Hence p is in the polar variety P(X, V). In fact, the projection Nyx N (P” x L2) —> P(X, V) 
onto the first factor is birational. Hence, 


Hi(X) = deg(P(X,V)) = |P(X,V)A Lil = |INx A (Lı x L2)| = 6;(X), 


which yields the stated formula. o 


Many varieties in applications are defined by rank constraints on matrices or tensors. 
Determinantal varieties and Segre—Veronese varieties will make an appearance in several 
subsequent chapters. It is thus worthwhile to examine their polar varieties and polar degrees. 
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Example 4.18 (A Segre variety) We revisit Example 2.22, where an explicit multidegree 
was computed. Let X be the variety of 3 x 3 matrices of rank 1. This is a 4-dimensional 
smooth subvariety of P®. As an abstract variety, we have X = P? x P*. The prime ideal of X 
is generated by the nine 2 x 2 minors of the 3 x 3 matrix. The prime ideal of the conormal 
variety Nx is the ideal I defined in the Macaulay2 fragment in Example 2.22. 

In light of Theorem 4.16, the polar degrees of the Segre variety X are 


Ho(X) = 6, Hi(X) = 12, H2(X) = 12, H3(X) = 6, Ha(X) = 3. 


Here a(X) = 4. The dual variety XY is a cubic hypersurface in PË. Its defining poly- 
nomial is the 3 x 3-determinant. This was called minors(3,h) in Example 2.22. By 
Theorem 4.15 (d), we have 


Mo(X”) = 3, m (X`) = 6, p(X”) = 12, 3 (X”) = 12, u4(X“) = 6. 


Can you describe the polar varieties P(X,V) and P(XY,V) in the language of linear 
algebra? Which matrices do they contain for a given linear subspace V of the matrix space 
R?”3 ? How about symmetric 3 x 3 matrices? © 


4.3 Chern Classes 


In what follows we take a look at a concept that is ubiquitous in differential geometry, 
algebraic geometry, and algebraic topology. These three fields are now conspiring to furnish 
mathematical foundations for data science. It is thus natural for our book to devote one 
section to an important theoretical topic in their intersection. 

Chern classes are topological invariants associated with vector bundles on smooth 
manifolds or varieties. If X is a smooth and irreducible projective variety, then the polar 
degrees u; (X) can be computed from the Chern classes of X. The formulas will be presented 
in Theorem 4.20. In light of Theorems 2.13 and 5.5, we can obtain formulas, in terms of 
Chern classes, for both ED degrees and Wasserstein degrees. 

To a vector bundle & of rank r on a variety X, we associate the Chern classes 
co(&),...,¢-(&). Formally, these are elements in the Chow ring of X. Chern classes 
are understood more easily when the vector bundle & is globally generated. Assuming that 
this holds for &, we choose general global sections o),...,0; : X — &. These j global 
sections determine the following subvariety of the given variety X: 


D(o1,...,07) := { x € X | o1(x),...,0;() are linearly dependent } ; (4.8) 


Experts refer to (4.8) as a degeneracy locus of the vector bundle &. In concrete scenarios, 
D(oj,...,0;) is the determinantal variety given by the maximal minors of a certain r x j 
matrix with linear entries. We are interested in the class of this variety in the Chow ring. 


Definition 4.19 The Chern class c,4,-;(&) is the class of (4.8) in the Chow ring of X. 
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For the purpose of this section, it is not necessary for the reader to master the definition 
of the Chow ring. It suffices for us to understand the degree of c,+1-; (E). This degree is a 
number, not a class. It is defined as the degree of the degeneracy locus D(a),..., 0j) as 
a projective variety, for general sections o; of &. For instance, the degree of the top Chern 
class c,(&) is the degree of the vanishing locus of a single general global section. 

There are some calculation rules that allow us to compute Chern classes of a vector 
bundle of interest in terms of simpler vector bundles. Most notable is the Whitney sum 
formula [69, Theorem 3.2]. This applies when we have a short exact sequence of vector 
bundles 0 — &’ — & > &” — 0. It states that 


cl) = >) ci(8') c8”). 
i+j=k 
Every smooth, irreducible variety X has a distinguished vector bundle, namely the tan- 


gent bundle T X.The Chern class cz (X) of X is an abbreviation for the Chern class cx (J X). 


Theorem 4.20 ([91, Equation (3)]) Let X be a smooth, irreducible projective variety, and 
let m := dim X. Then, 


wi(X) = Denhe. (4.9) 


= m-i 
This formula can be inverted to write degrees of Chern classes in terms of polar degrees: 


m—-itl 
m-k+1 


k 
deg(cx(X)) = xen Juo. (4.10) 
i=0 


Remark 4.21 Both formulas also hold for singular varieties, after replacing the classical 
Chern classes with Chern—Mather classes. That result is due to Piene [145]. 


An important difference between polar degrees and Chern classes is the following: Polar 
degrees are projective invariants of the embedded variety X c P”. This holds also more 
generally for the polar classes, i.e., the rational equivalence classes (in the Chow ring of X) 
of the polar varieties. Chern classes are even intrinsic invariants of the variety X, i.e., they 
do not depend on the embedding of X in projective space. 

Example 4.22 Here are some basic facts about the Chern classes of a smooth variety X. 
(a) We see from (4.10) that deg(co(X)) = uo(X) = deg X. 
(b) The degree of the top Chern class of X equals its topological Euler characteristic: 
deg(cm(X)) = x(X), m =dimX. 


(c) If X is a curve of genus g(X), then deg(c1ı(X)) = y(X) = 2 — 2g(X) is independent 
of the embedding, while we see from (4.9) that 


H(X) = 2 deg X — deg(c;(X)) = 2(deg X + g(X) - 1), (4.11) 


which does depend on the embedding of X. © 
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Example 4.23 If X c P” is a rational curve, then u (X) = 2 deg X — 2 by (4.11). We can 
check this by examining the various cases: 


e If X is a line, then its dual variety XY is never a hypersurface, and so u (X) = 0. 

e If X is a conic (i.e., deg X = 2) then X” is a (cone over a) conic, and so we have 
that uı(X) = deg XY = 2. 

e If X is a twisted cubic (i.e., deg X = 3), then XY is (a cone over) the discriminant in 
(4.5). Therefore, uı(X) = deg X“ is the degree of that discriminant, which is 4. See 
Example 4.13. 

e If X is a rational normal curve of degree d, then its dual variety is (a cone over) the 
discriminant of a binary form of degree d. Its degree is uı(X) = deg XY = 2d-2. o 


We close with the expression for the ED degree in terms of Chern classes. 


Corollary 4.24 Let X be a smooth variety of dimension m in P” which satisfies the hy- 
potheses in Theorem 2.13. These always hold after a general linear change of coordinates. 
We have 


m 


EDdegree(X) = Sev! (2™*!-t _ 1) deg(c;(X)). 
i=0 


Proof Equation (2.19) in Theorem 2.23 shows that the ED degree is the degree of the 
conormal variety Nx. By Theorem 4.16, this is the sum of the multidegrees, and hence the 
sum of the polar degrees. See Theorem 2.13. We now simply take the sum of the alternating 
sums in (4.9) fori =0,1,...,m. oO 


As an application, we now compute the generic ED degree of the Veronese variety. 


Example 4.25 Let n = (9) — 1 and let X c P” be the d-th Veronese embedding of P”. 
The generic ED degree of X from Definition 2.8 satisfies 


(2d - 1"! _ (d = 1"! 


EDdegree,.,(X) = 7 


(4.12) 
This is precisely the ED degree of the image of X under a generic linear change of 
coordinates in P”. Corollary 4.24 is a formula for the generic ED degree. 

We shall now derive (4.12) from Corollary 4.24. Our argument will be taken from the 
proof of [60, Proposition 7.10]. The ith Chern class of the underlying projective space P” 
is c;(P™) = ie where h is the hyperplane class in P”. Since X is the image under 
the dth Veronese embedding of P”, its hyperplane class is dh. The degree of its Chern class 
ci(X) is the integral of (dh)""~'c;(P) over P”. Since hh’ = h™ = 1, the numerical 
value of this formal integral (in the Chow ring of P”) equals deg(c;(X)) = ("*) a"i. 
We plug this into Corollary 4.24. After some algebraic manipulations, we arrive at (4.12). 
For m = 1 the formula (4.12) evaluates to d + (2d — 2) = uo(X) + p(X), as desired. © 


In conclusion, Chern classes provide a conceptual framework for the degrees of opti- 
mization problems in metric algebraic geometry. This chapter offered a geometric intro- 
duction. Example 4.25 is a nice illustration of how Chern classes are used in practice. 
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Chapter 5 A 


Check for 


Wasserstein Distance Ezg 


A fundamental problem in metric algebraic geometry is distance minimization. We seek a 
point in a variety X in R” that is closest to a given data point u € R”. Thus, we wish to 


minimize ||x — uļ|| subject to x € X. (5.1) 


In Chapter 2 we studied this problem for the Euclidean distance on R”. We here examine 
(5.1) for the case when the distance is given by a polyhedral norm. We note that the 
minimum in (5.1) is always attained because X is non-empty and closed. Hence, there 
exists at least one optimal solution. If that solution is unique then we denote it by x*. In 
Sections 5.2 and 5.3 we focus on a special class of polyhedral norms that arise from optimal 
transport theory. The corresponding distance is known as Wasserstein distance. 


5.1 Polyhedral Norms 


A norm || - || on the real vector space R” is a polyhedral norm if its unit ball is polyhedral: 
B = {xe R” | ||x|| < 1}. 


More precisely, B is a centrally symmetric convex polytope. Conversely, every centrally 
symmetric convex polytope B in R” defines a polyhedral norm on R”. Using the unit ball, 
we can paraphrase the optimization problem (5.1) as follows: 


minimize 2 subject to 2 > 0 and (u+AB) OX Z 0. (5.2) 


Example 5.1 The unit ball B c R” of the co-norm ||x||..0 = max; <;<y |x;| is the regular 


cube [—1,+1]”. The unit ball of the dual norm ||x||; := |x;| +--- + |x,| is the convex 
polytope conv{+e;, +e2,...,+e,} C R”. The latter polytope is called the crosspolytope. 
It generalizes the octahedron from n = 3 ton > 4. © 
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Polyhedral norms are very important in optimal transport theory, where one uses a Wasser- 
stein norm on the space of probability distributions. Polytopes arise for distributions on 
finite state spaces. This will be our main application, to be developed later in this chapter. 

We begin our discussion with a general polyhedral norm; that is, we allow the unit ball B 
to be an arbitrary n-dimensional centrally symmetric polytope in R”. As before, we use the 
Euclidean inner product (-,-). Recall that a subset F of the polytope B is called a face if 
there exists a vector v € R” \{0} such that 


F = {xe B|(x,v) > (y,v) forallye B}. 


One says that the face F maximizes the linear functional f(x) := (x, v}. The vector v is 
a normal vector of F. The boundary of B consists of faces whose dimensions range from 
0 to n — 1. Faces of maximal dimension are called facets. The set of all faces, ordered 
by inclusion, is a partially ordered set, called the face poset of B. For an introduction to 
polytopes see Ziegler’s book [182]. An important combinatorial invariant is the f-vector. 


Definition 5.2 Let B c R” be a polytope. The f-vector of B is f(B) = (fo, fi,- -> fa-1), 
where f; denotes the number of i-dimensional faces of B, forO < i < n- 1. 


The dual of the unit ball B is also a centrally symmetric polytope, namely it is the set 
BY = {v e R” | (x, v} < 1 forall x € B}. 


The norm ||- ||. defined by the dual polytope B* is dual to the norm || - || given by B. The 
f-vector of B* is the reverse of the f-vector of B. More precisely, we have 


fi(B*) = fa-1-i(B) for i=0,1,...,n-1. 
Example 5.3 Fix the unit cube B = [—1, 1]”. Its dual is the crosspolytope 
B* = conv{+e;,+@2,...,+e,} C R”. 


Here e; is the jth standard basis vector. The number of i-dimensional faces of the cube is 
fi(B) = (7) - 2". Consequently, its f-vector is f(B) = (2", 2"! n, ..., 2n) € R”. 

The 3-dimensional crosspolytope is the octahedron. The cube B has 8 vertices, 12 edges 
and 6 facets. By duality, the octahedron B* has 6 vertices, 12 edges, and 8 facets. 


Their f-vectors are f(B) = (8,12,6) and f(B*) = (6, 12,8). These numbers govern the 
combinatorial structure of the associated polyhedral norms ||- ||,. and ||- ||; on R°. © 


We now turn to the optimization problem given in (5.1) or (5.2). To derive the critical 
equations, we shall use a combinatorial stratification of the problem by the face poset of 
the polytope B. The next lemma associates a unique face F of B to the optimal point x*. 
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We write Lr for the linear span of the face F in R”. If F has dimension j < n, then 
Lpr has dimension j + 1, because the origin lies in the interior of B. For instance, if F is a 
vertex, then Lp is the line through F and the origin. If F is a facet, then Lr = R”. 


Lemma 5.4 Suppose that X is in general position. Given u € R", let x* be an optimal 
solution in (5.2) and let à* be the optimal value. Then x* is unique, and the point + (x*-u) 
lies in the relative interior of a unique face F of the polytope B. Let £p be a linear 
functional whose maximum over B is attained on F. Then, the optimal point x* in (5.1) can 
be recovered as the unique solution of the optimization problem 


Minimize p(x) subject to x € (u+Lr) OX. (5.3) 


Proof We defined A* to be the optimal value of (5.2). By construction, it is also the optimal 
value of (5.3). The general position hypothesis ensures that the affine space u+ Lp intersects 
the real variety X transversally, and x* is a smooth point of that intersection. The genericity 
assumption ensures that (u +4*B) N X = {x*} is a singleton. Hence, the optimal point x* 
is unique, and it is also the minimizer of the linear functional £p on the variety (u+L-)NX. 
By our hypothesis, this linear function is generic relative to the variety, so the number of 
critical points is finite and the function values are distinct. o 


Lemma 5.4 motivates the following strategy for the distance minimization problem (5.1): 
For each of the finitely many faces F of B, solve the linear program (5.3) over X, and de- 
termine the distance from u to X from this finite amount of data. This splits into three 
subtasks: Combinatorial Preprocessing, Numerical Optimization, and (optionally) Alge- 
braic Postprocessing. We give a high-level description of these steps in Algorithms 1-3. 


Algorithm 1: Combinatorial Preprocessing 


Output: A description of all faces F of the unit ball B. 
1 Depending on how B is presented, apply appropriate tools from computational geometry to 
compute all its faces. 
2 For each face F, fix a linear functional £F that is maximized on F, and let Ly be the linear span of 
the vertices of F. 
3 return the list of pairs (£F, Lr), one for each face of B. 


The complexity of computing the distance to a model X in the polyhedral norm has two 
components, seen clearly in (5.4). One is the combinatorial complexity of the unit ball B, 
which is encoded in the f-vector f (B). This complexity affects Algorithm 1. Admittedly, 
we did not specify any details for the computation of the face lattice of B. Instead, we refer 
the reader to the vast literature on algorithms in polyhedral geometry. 

The second component in the complexity of our problem (5.1) is the algebraic complexity 
of solving the linear optimization problem (5.3). It governs Algorithm 2. For an illustration 
see Figure 5.1 and the discussion in Examples 5.7 and 5.8. 

We now determine the algebraic degree of the optimization problem (5.3) when F is 
a face of codimension i. To this end, we replace the affine variety X c R” and the affine 
space L = u + Lr in R” by their respective closures in complex projective space P”. We 
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Algorithm 2: Numerical Optimization 


Input: Model X and a point u € R”; complete output from Algorithm 1. 

Output: The optimal solution x* of (5.1) along with its corresponding face G. 

1 for each face F of the unit ball B do 

2 Solve the optimization problem (5.3). 

3 Store the optimal solution x and a basis for the linear subspace Lr of R”. 

4 end 

5 Among all candidate solutions, identify the solution x* for which the distance to X to the given 
data point u is smallest. Record its face G. 

6 return the optimal solution x*, its associated linear space Lg, and the face normal tg. 


Algorithm 3: Algebraic Postprocessing 


Input: The optimal solution (x*, G) to (5.1) in the form found by Algorithm 2. 
Output: The maximal ideal in the polynomial ring Q[x1, . - - , Xn ] which has the zero x*. 
1 Use Lagrange multipliers to give polynomial equations that characterize the critical points of the 
linear function lg on the subvariety (u + LG) X in the affine space R”. 
2 Eliminate all variables representing Lagrange multipliers from the ideal in the previous step. This 
ideal lives in Q[x1,..-,Xn]- 
if the ideal in step 4 is maximal then 
| Call the ideal M. 
else 
| Remove extraneous primary components to get the maximal ideal M of x*. 
end 
Determine the degree of x*, which is the dimension of Q[x1,...,Xn]/M over Q. 
return the generators for the maximal ideal M along with the degree. 


cmt an ea w 


retain the same symbols X and L for the respective projective varieties. The following 
result formulates the algebraic degree of the linear program (5.3) in terms of the polar 
degrees that we have introduced in Definition 2.18 and again in Chapter 4. 


Theorem 5.5 Let L C R” be a general affine-linear space of codimension i — 1 and € be a 
general linear form. The number of critical points of € on LA X is the polar degree 6;(X). 


Proof This result appears in [44, Theorem 5.1]. The number of critical points of a general 
linear form on the intersection LN X in R” is the degree of the dual variety (L A X)“. That 
degree coincides with the polar degree 6;(X). oO 


Theorem 5.5 offers a direct interpretation of each polar degree 6;(X) in terms of optimiza- 
tion on X. This interpretation can be used as a definition of polar degrees. Some readers 
might prefer this over the definitions given in Chapter 4. 


Corollary 5.6 Jf the variety X is in general position, then the total number of critical points 
of the optimization problem (5.1) arises from the f-vector of B and the polar degrees of X. 
That number equals 


On(X) - fo(B) + 6n-1(X) + fi(B) + +++ + 61(X) > fa-1 (B). (5.4) 


Proof The total number of critical points is œo - fo(B) +-+++@n-1- fa-1 (B), where a; is 
the number of critical points of (5.3) for a face of dimension j. Fix a face F of dimension /. 
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Fig. 5.1: The green sphere is the given variety X. The data point u is white. We solve the problem (5.1) for 
the norm ||- ||. Balls in this norm are cubes. The contact point x* is marked with a cross. The optimal 
face F is a facet, vertex, or edge. 


The affine linear space L := u + Lp has codimension n — j — 1 in R”. Since X is assumed to 
be in general position, L is general relative to X and there exists a general linear functional 
£ that attains its maximum over B at F. Therefore, we can apply Theorem 5.5, and we find 
that the number of critical points of (5.3) is œ; = 6n-;(X). o 


We have learned in Chapter 2 that the polar degrees 6;(X) of a variety X determine its 
ED degree and hence the algebraic complexity of Euclidean distance minimization for X. 
Corollary 5.6 highlights that this applies not just to the Euclidean distance, but also to the 
analogous problem for polyhedral norms. The two extreme cases in Theorem 5.5 are i = 1 
and i = n. Touching at a vertex (i = n) can only happen when X is a hypersurface, and here 
6n(X) = degree(X). Touching at a facet (i = 1) can happen for varieties of any dimension, 
as long as the dual variety XY is a hypersurface. Here, we have 6;(X) = degree(X”). 


Example 5.7 (Touching at a facet) Suppose that the face F is a facet of the unit ball B. 
Then Lr = R”, and fF is an outer normal vector to that facet, which is unique up to scaling. 
Here, the optimization problem (5.3) asks for the minimum of £p over X. This situation 
corresponds to the left diagram in Figure 5.1. © 


Example 5.8 (Touching at a vertex) Suppose F is a vertex of the unit ball B. This case 
arises when X is a hypersurface. It corresponds to the middle diagram in Figure 5.1. Here, 
the affine space u + Ly is the line that connects u and x*. That line intersects X in a finite 
set of cardinality degree(X). The optimum x” is the real point in that finite set at which the 
value of the linear form £ is minimal. © 


Next, we work out Lemma 5.4 and Theorem 5.5 when X is a general surface in R3. 


Example 5.9 Consider the problem in (5.1) and (5.2) for a general surface X of degree d 
in RÌ. The optimal face F of the unit ball B depends on the location of the data point u. The 
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algebraic degree of the solution x* is 63(X) = d if dim(F) = 0, itis 62(X) = d(d — 1) 
if dim(F) = 1, and itis 6;(X) = d(d — 1)? if dim(F) = 2. Here x* is the unique point in 
(u + A*B) N X, where A* is the optimal value in (5.2). 

Figure 5.1 visualizes these three cases for d = 2 and || - ||. The variety X is the green 
sphere, which is a surface of degree d = 2. The unit ball for the norm || - ||.. is the cube 
B = [-1,1]°. The picture shows the smallest A* such that u + A*B touches the sphere X. 
The cross marks the point of contact. This is the point x* in X which is closest in co-norm 
to the white point u in the center of the cube. The point of contact is either on a facet, or 
on an edge, or it is a vertex. The algebraic degree of x* is two in all three cases, i.e. we can 
write the solution x* in terms of the data u by solving the quadratic formula. If the green 
surface X were a cubic surface (d = 3) then these polar degrees would be 3,6 and 12. © 


Our geometric discussion can be translated into piecewise-algebraic formulas for the 
optimal point x* and the optimal value 4*. This rests on the algebraic postprocessing in 
Algorithm 3. It is carried out explicitly for a statistical scenario in Theorem 5.14. In that 
scenario, X is also a quadratic surface in 3-space, just like the green ball in Figure 5.1. 


5.2 Optimal Transport and Independence Models 


We now come to the title of this chapter, namely the Wasserstein distance to a variety X. 
For us, X will be an independence model in a probability simplex, given by matrices or 
tensors of low rank (see Chapter 12). We measure distances using Wasserstein metrics on 
that simplex. This is a class of polyhedral norms of importance in optimal transport theory. 

We now present the relevant definitions. A probability distribution on the finite set 
[n] = {1,2,...,n} is a point v in the (n — 1)-dimensional probability simplex 


Aya 3= {Oi Vn) ERZ | vite: tun =l}. 


We shall turn this simplex into a metric space, by means of the Wasserstein distance. To 
define this notion, we first turn the finite state space [n] into a finite metric space. The 
metric on [n] is given by fixing a symmetric n x n matrix d = (d;;) with nonnegative 
entries. These entries satisfy d;; = 0 and dig < dj; + djx for alli, j, k. 


Definition 5.10 Given two probability distributions 4, v € Aņn-1, we consider the following 
linear programming problem, where z = (z1, . . . , Zn) is the decision variable: 


Maximize > (ui — vi) Zi Subjectto |z;—zj| < dij forall l<i<j<n. (5.5) 


i=l 
The optimal value of (5.5), denoted Wa(, v), is the Wasserstein distance between p and v. 


The optimal solution z* to problem (5.5) is known as the optimal discriminator for the 
two probability distributions yz and v. It satisfies Wa (u, v) = (u — v, z*}, where (+, -) is 
the Euclidean inner product on R”. The coordinates z} of the optimal discriminator z* are 
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the weights on the state space [n] that best tell 4 and v apart. The linear program (5.5) is 
the Kantorovich dual of the optimal transport problem. The feasible region of the linear 
program (5.5) is unbounded because it is invariant under translation by 


1=(1,1,...,1) € R”. 
It is compact after taking the quotient modulo the line R1. This motivates the following. 
Definition 5.11 The Lipschitz polytope of the finite metric space ([n], d) equals 
Pa = {z e R”/R1 | |z;-z;| < di; forall 1<i< j< n}. (5.6) 


Note that dim(P4) < n — 1. Lipschitz polytopes arise prominently in tropical geometry 
(see [133, Chapter 7]), where they are called polytropes. A polytrope is a subset of R” /R1 
that is convex both classically and tropically. The dual polytope P% lies in the hyperplane 
perpendicular to the line R1. We call P% a root polytope because its vertices are, up to 
scaling, the elements e; — e; in the root system of Lie type An-1. The root polytope equals 


P% {x e€ R” | maxzep,(x,Z) < 1} 


(5.7) 


1 ES 
conv { gz (€: — ej) | 1<sij<ny}. 
This is a centrally symmetric polytope since the metric d satisfies dj; = dji. 


Proposition 5.12 The Wasserstein metric Wq on the probability simplex An-1 is equal to 
the polyhedral norm whose unit ball is the root polytope P*,. Hence, all results in Section 5.1 
apply to Wasserstein metrics. 


Proof Fix the polyhedral norm with unit ball P%. The distance between p and v in this 
norm is the smallest real number A such that u € v+A P%, or, equivalently, +( py) E€ P}. 
By definition of the dual polytope, this minimal J is the maximum inner product (u — v, z) 
over all points z in the dual (P*,)* of the unit ball. But this dual is precisely the Lipschitz 
polytope because the biduality (P%)* = Pa holds for polytopes. Hence the distance between 
H and y is equal to Wa(, v), which is the optimal value in (5.5). o 


Example 5.13 Let n = 4 and fix the finite metric from the graph distance on the 4-cycle 


0112 


1021 
@= i011: (5.8) 


2110 
The induced metric on the tetrahedron A3 is given by the Lipschitz polytope 
Pa = {(x1,%2,%3,%4) € R*/RI | [xy = x21 < 1, |x1 — x3] < 1, [x2 — x4| < 1, [x3 — x4 < 1} 


= conv{ + (1,0,0,-1), +($,-3,-3.5), +(0,1,-1,0)}. 
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Note that this 3-dimensional polytope is an octahedron. Therefore, its dual is a cube: 


P3 = {(y1,y2,y3,y4) € (RDH | Iyi -yal < 1, |y2 -= ysl < 1, ly2 + y3| < 1} 


= conv{+(1,-1,0,0), +(1,0,-1,0), +(0,1,0,-1), +(0,0, 1,-D}. 


This is the unit ball for the Wasserstein metric on the tetrahedron A3 that is induced by d. 
Figure 5.1 illustrates the distance from a point to a surface with respect to this metric. o 


We wish to compute the Wasserstein distance from a given probability distribution y to 
a fixed discrete statistical model. We now denote this model by 


Mc An-1- 


This is the scenario studied in [43,44]. The remainder of this chapter is based on the 
presentation in these two articles. As is customary in algebraic statistics [167], we assume 
that M is defined by polynomials in the unknowns v = (11,..., Yn). Thus 


M=XNAn-1 
for some algebraic variety X in R”. Our task is to solve the following optimization problem: 


WaltA) cst mn Waly) =: Tat NG ee) (5.9) 
Computing this quantity means solving a non-convex optimization problem. Our aim is to 
study this problem and propose solution strategies, using methods from geometry, algebra 
and combinatorics. We summarized these in Algorithms 1-3. Similar strategies for the 
Euclidean metric and the Kullback—Leibler divergence are found in Chapters 2 and 11. 
We conclude this section with a detailed case study for the tetrahedron A3 whose points 
are joint probability distributions of two binary random variables. The 2-bit independence 
model M C A3 consists of all nonnegative 2 x 2 matrices of rank one whose entries sum 
to one. This model has the parametric representation 


vı v2 p4 p(l-q) 2 
I "| = rae (l-p)(-q)|’ (ayer om) 
Thus, M is the surface in the tetrahedron A3 defined by the equation vjv4 = v2v3. The 
next theorem gives the optimal value function and the solution function for this indepen- 
dence model. We use the Wasserstein metric Wy from Example 5.13. For the proof of 
Theorem 5.14 and a simpler example we refer to the paper [43] by Celik, Jamneshan, 
Montúfar, Sturmfels and Venturello. 

Theorem 5.14 involves a distinction into eight cases. This division of A3 is shown in 
Figure 5.2. Each of the last four cases breaks into two subcases, since the numerator in the 
formulas is the absolute value of u1 u4 — 23. The sign of this 2 x 2 determinant matters for 
the pieces of our piecewise algebraic function. Thus, the tetrahedron A3 is divided into 12 
regions. On each region the optimal value map u > Wa( u, M) is one algebraic function. 
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Theorem 5.14 The Wasserstein distance from a distribution u € A3 to the model M equals 


2441 (1 - Ven) —po- W3, ifm > p4, Vi = H+, Vi È Hi +p, 
24m (1 - VM) - Hi — H4, ifm > oa, VM ÈZ Hitu, Vio È pM +p, 
2463 (1 - VA) — Hi — H4, ifu3 È bo, VIB ÈZ Hit oR, VIB H3 + Ma, 
24p4(1 - Ves) - 42 - u3, es Z p, Ves Z bot ms, Vos = wt ms, 
laius — my3 |/ (41 +42), if Hi Z Ma, Mo = p3, Mit+p2 = VAT, Hit = VM, 


Wa(u, M) = 


lays — p2p3l/ (41 +43), fn È H4, H3 È p2, Hi+H3 = VF, Hi+tH3 = VIB, 
luus — mps l|/ (u2 + u4), ifu4 = Hi, Mo È H3, M2+H4 = Ha, H2+H4 = yM, 


lays — p2p3l/ (u3 +a), fa È Hi, H3 È H2, H3+H4 = Via, H3+H4 > yP- 


The solution function A; > M, u œ> v*(p) is given (with the same case distinction) by 


(41, VH- H, yA- H, -2V0 + H +1), 
(V-m, m, -24m + m+1, ym - pn), 
(VIB - 43, -24VF3 + u3 +1, u3, VB H3), 
(-2Vuatmatl, yA- pa, Vea ma, pa), 
( 
( 
( 


v (u) = 


Mi, Ho, Hı(p3+44)/(pi+u2), Mo (usta) / (an 


> 


2) 
) 


H 
H 


Hi, Mi (Motes) / (ei tes), H3, 63 (Motma)/ (Mites 


> 


p2(p1+y3)/ (2+4) , H2, Ha(eatp3)/ (Hots), Ha 


>’ 


) 
) 
) 
(u3 (u1+m2)/(u3+u4) , Ma (either) / (usta), u3, Ma). 


The boundaries separating the various cases are given by the surfaces 


{u € A3 | wi — wa = 0, 41 + u2 = Ve, Hi + u3 = Ver} 
and {u € A3 | u2 — u3 = 0, 41 + u2 = Veo, Mo + Ma > VAa}. 
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At the end of the previous section, we studied the Wasserstein distance from a probability 
distribution m in the three-dimensional probability simplex to the variety of 2 x 2 matrices 
of rank one. We now turn to the general case. Let M be an arbitrary smooth variety in 
An-1 C R”. For the moment, we do not specify a statistical model M; later in this section, 
it will be an independence model. Furthermore, let d = (dij) € R”*” induce a metric on 
the finite state space [n]. Here are three examples of metrics ([n], d): 


e The discrete metric on any finite set [n] where di; = 1 for distinct i, j. 

e The Lo-metric on the Cartesian product [7m] x--- X [mx] where di; = #{1 | iy + ji}. 
Here i = (ij,...,é,) and j = (j1,..., Jk) are elements in that Cartesian product. 

e The L;-metric on the Cartesian product [1m] x --+ x [mg] where di; = a liz — jil. 
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Fig. 5.2: The optimal value function of Theorem 5.14 subdivides the tetrahedron (left). The red surface 
consists of four pieces that, together with the blue surface, separate the eight cases in Theorem 5.14. Four 
convex regions are enclosed between the red surfaces and the edges they meet. These regions represent the 
first four cases in Theorem 5.14. The remaining four regions are each bounded by two red and two blue 
pieces, and correspond to the last four cases. Each of these four regions is further split in two by the model. 
We do not depict this in our visualization. The two sides are determined by the sign of the determinant 
[4 — H2 H3. The two blue surfaces (right) specify the points y € A3 with more than one optimal solution. 


For the last two metrics, the number of states of the relevant independence models is 
n=my,---my. All three metrics above are graph metrics. This means that there exists an 
undirected simple graph G with vertex set [n] such that d;; is the length of the shortest path 
from i to j in G. The corresponding Wasserstein balls are called symmetric edge polytopes. 
The combinatorics of these polytopes is investigated in [44, Section 4]. 


For any y € A,_1, we now seek the Wasserstein distance Wg (u, M) to the model 
M. Recall from Proposition 5.12 that the unit ball of the Wasserstein metric is the root 
polytope P% = conv {az (ei -e;) | l<ij<n I. As before, for computing Wa(u, M) 
we iterate through the faces of the unit ball B = P*,, and we solve the optimization problem 
in Lemma 5.4. That is, for a fixed face F of the polytope B we solve: 


Minimize fr = €r(v) subject to v € (u+Lr) NM, (5.11) 


where fr is any linear functional on R” that attains its maximum over B at F. 
Let F be the set of pairs (i, j) such that the point ay (Ci — ej) is a vertex of B and it lies 


in F. The linear space spanned by the face F is 
Lr = {Xejper Ayler—ey) | Aj ER}. 


With this notation, the decision variables for (5.11) are the multipliers 2;; for (i, j) € F. 
The algebraic complexity of this problem is given by the polar degree (Theorem 5.5). 
The combinatorial complexity is governed by the facial structure of the Wasserstein ball 
B = P*,. They are combined in Corollary 5.6. 
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We now work this out for the case when M C Apn-1 is an independence model for 
discrete random variables, given by tensors of rank one. We denote by (m), a multinomial 
distribution with m possible outcomes and r trials, which can be interpreted as an unordered 
set of r identically distributed random variables on [m] = {1, 2, ..., m}. The subscript r is 
omitted if r = 1. For integers mı, ..., mg and r1, ...,rg we consider the model M whose 
elements are given by k independent multinomial distributions (m1),,,..., (mx)y,. We 
denote this independence model by 


M= ((mi)r, ERE (mx)r,)- 


The number of states of the model M equals 


£ mi+ri— 1 


Example 5.15 The variety M = (22, 2) is the independence model for three binary random 
variables where the first two are identically distributed. This model has n = 6 states. Note 
that M is the image of the map from the square [0, 1]? into the simplex A5 given by 


(p,q) > (pq, 2p. - p)q, Q- p}4q, PO -q), 2p0- p-a), -pA -q)). 


Our parameterization lists the n = 6 states in the order 00, 10, 20, 01, 11,21. These are the 
vertices of the associated graph G, which is the product of a 3-chain and a 2-chain. © 


Example 5.16 The following four models are used for the case studies in [44, Section 6]: 
the 3-bit model (2,2,2) with the Lo-metric on [2]*, the model (3,3) for two ternary 
variables with the L;-metric on [3]*, the model (26) for six identically distributed binary 
variables with the discrete metric on [7], and the model (22, 2) in Example 5.15 with the 
L,-metric on [3] x [2]. In Table 5.1, we report the f-vectors of the Wasserstein balls for 


each of these models, thus hinting at combinatorial complexity. © 
M n |dim( M) |Metric d| f-vector of the (n—1)-polytope P$ 
(2,2,2)| 8 3 Lo = Ly |(24, 192, 652, 1062, 848, 306, 38) 
(3, 3) 9 4 Lı (24, 216, 960, 2298, 3048, 2172, 736, 82) 
(26) |7| 1 Idiscrete |(42, 210, 490, 630, 434, 126) 
(22,2) l6) 2 JL (14, 60, 102, 72, 18) 


Table 5.1: f-vectors of the Wasserstein balls for the four models in Example 5.16. 


As is customary in algebraic statistics, we replace the independence model introduced 
above, which is a semialgebraic set inside a simplex, by its complex Zariski closure in a pro- 
jective space. This allows us to compute the algebraic degrees of our optimization problem. 
Independence models correspond in algebraic geometry to Segre—Veronese varieties. The 
Segre—Veronese variety M = ((11),,,---, (Mg)rą) is the embedding of P™!7! x- . xP”! 
in the projective space of partially symmetric tensors, P(S”! (R™) 8 --- @ S’*(R™)). 
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That projective space equals P”-! where n = JJ% (”" ol By definition, the Segre— 
Veronese variety M is the set of all tensors of rank one inside this projective space. The 
dimension of M is m := mı +--- +m, — k. Tensors of rank one are discussed in more 
detail in Chapter 12. See Section 12.3 for their volumes. 


Example 5.17 Let k = 2. The Segre—Veronese variety M = (22,2) is an embedding 
of P! x P! into P5. This is a toric surface of degree four. Its points are rank one tensors of 
format 2 x 2 x 2 which are symmetric in the first two indices. This model appears in the 
last row of Table 5.3. © 


The computation of the Chern classes of the Segre—Veronese variety M first appeared in 
the doctoral dissertation of Luca Sodomaco [160, Chapter 5]. We obtain the polar degrees 
using the formula that connects Chern classes with polar degrees in Theorem 4.20. We 
here state the result of this computation, which was published in [113, Proposition 6.11]. 


Theorem 5.18 With the notation above, the polar degrees of the Segre—Veronese variety 
are given by the following formula: 


pine-1- ae 


m+l- k 
; 1 ! .12 
õjı(M) = Sc ee Jo =i)! L (5.12) 
where A = {a E N* | a, +++: +a =iand a; < mi- lfori=1,...,k}. 


We next examine this formula for various special cases, starting with the binary case. 


Corollary 5.19 Let M be the k-bit independence model (this is the case rı =-++-=rz = 1 
and mı = :: : =m, = 2). The formula (5.12) specializes to 
k+1- a {KY a: 
6j1(M) = xe 1 en 1) an (iz. (5.13) 


In algebraic geometry language, our model M is the Segre embedding of (P!)* into 
Pp?*-1, This is the toric variety associated with the k-dimensional unit cube. Its degree is 
the normalized volume of that cube, which is k!. The dual variety M“ is a hypersurface of 
degree 61, known as the hyperdeterminant of format 2*. For instance, for k = 3, this hyper- 
surface in P’ is the 2 x 2 x 2-hyperdeterminant. This hyperdeterminant is a homogeneous 
polynomial of degree four in eight unknowns. We computed it in Example 3.16. The polar 
degrees for the k-bit independence model in (5.13) are shown for k < 7 in Table 5.2. 

We now briefly discuss the independence models (m1, m2) for two random variables. 
These are the classical determinantal varieties of mı X m2 matrices of rank one. Here, 
n=mm2 and m = m; +m — 2. 


Corollary 5.20 The Segre variety M = P™-! x P™-1 in P"! has the polar degrees 


mı+m2—-2-j ş 
[m +m- l-i 
ôjı(M) = 2, an TA Jom +m -2- bod, 


my m? 


where o (i) = >i; TEE E and the sum is over the set of integers s such that 
both mı — 1 — s and m — 1 — i + s are nonnegative. 
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k=2| k=3| k=4| k=5| k=6| k=7 
5040 

720 | 30240 

120 | 3600 | 80640 

124320 


6 72 840 | 9840 |120960 
2 12 96 800 | 7440 | 75936 
2 12 64 440 | 3408 | 30016 
2 4 24 128 | 880 | 6816 
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Table 5.2: The table shows the polar degrees 6;,1(M) of the k-bit independence model for k < 7. The 
indices j with 6;,;(M) + 0 range from 0 to k. The bottom row, labeled 0, contains the degree of the 
hyperdeterminant M“. On the antidiagonal (j = k) we find degree(M) = k!. The entries in the first 
column (k = 2) correspond to the three scenarios in Figure 5.1, where each algebraic degree equals 2. 


The polar degrees above serve as upper bounds for any particular Wasserstein distance 
problem. For a fixed model M, the equality in Theorem 5.5 holds only when the data (£, L) 
is generic. However, for the optimization problem in (5.11), the linear space L = Lp and 
the linear functional £ = £p are very specific. They depend on the Lipschitz polytope Py 
and the position of the face F relative to the model M. 


Proposition 5.21 Consider the optimization problem (5.11) for the independence model 
M = ((11)r,,-++(Mk)r,) with a given face F of the Wasserstein ball B = P}. Suppose 
that F has codimension i. The number of critical points of (5.11) is bounded above by the 
polar degree 5;(M). 


Proof The critical points of (5.11) are given as the solutions of a system of polynomial 
equations that depends on parameters (y,¢€,L). It follows from Theorem 5.5 that the 
number of critical points for general parameters is the polar degree 6;(M). The Parameter 
Continuation Theorem (Theorem 3.18) implies that the number of solutions can only go 
down when we pass from general parameters to special parameters. oO 


Example 5.22 We investigate the drop in algebraic degree experimentally for the four 
independence models in Example 5.16. In the language of algebraic geometry, these models 
are the Segre threefold P! x P! x P! in P”, the variety P? x P? of rank one 3 x 3 matrices 
in P$, the rational normal curve P! in P = P(S°(R7)), and the Segre—Veronese surface 
P! x P! in P’ = P(S? (R?) @ R’). The finite metrics d are specified in the fourth column 
of Table 5.1. The fifth column in Table 5.1 records the combinatorial complexity of our 
optimization problem. The algebraic complexity is recorded in Table 5.3. 

The second column in Table 5.3 gives the vector (60, 61,...,6n-2) of polar degrees. 
The third and fourth columns are the results of a computational experiment. For each 
model, we take 1000 uniform samples y with rational coordinates from A,_;, and we 
solve the optimization problem (5.9). The output is an exact representation of the optimal 
solution v*. This includes the optimal face F that specifies v*, along with its maximal ideal 
over Q. The algebraic degree of the optimal solution v* is computed as the number of 
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M |Polar degrees Maximal degree Average degree 
(2, 2, 2)|(0,0,0,6,12,12,4) (0,0,0,4, 12,6,0) |(0,0,0,2.138, 6.382, 3.8, 0) 
(3, 3)|(0, 0, 0, 6, 12, 12, 6, 3) | (0, 0, 0, 2, 8, 6, 6, 0) | (0, 0, 0, 1.093, 3.100, 4.471, 6.0, 0) 
(26) | (0, 0, 0, 0, 6, 10) (0, 0, 0, 0, 6, 5) (0, 0, 0, 0, 6, 5) 
(2, 2)| (0, 0, 4, 6, 4) (0, 0, 3, 5, 2) (0, 0, 2.293, 3.822, 2.0) 


Table 5.3: The algebraic degrees of the problem (5.9) for the four models in Example 5.16. 


complex zeros of that maximal ideal. This number is bounded above by the polar degree 
(cf. Proposition 5.21). The fourth column in Table 5.3 shows the average of the algebraic 
degrees we found. For example, for the 3-bit model (2, 2, 2) we have 63 = 6, corresponding 
to P% touching M ata 3-face F. However, the maximum degree we saw in our computations 
was 4, with an average degree of 2.138. For 4-faces F, we have 64 = 12. This degree was 
attained in some runs. The average of the degrees for 4-faces was found to be 6.382. © 


In this chapter, we measured the distance to a real algebraic variety with a polyhedral 
norm. We focused on the important case when the norm is a Wasserstein norm and the 
variety is an independence model. We emphasized the distinction between combinatorial 
complexity, given by the f-vector of the unit ball, and algebraic complexity, given by the 
polar degrees of the model. This distinction was made completely explicit in Theorem 5.14. 
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Chapter 6 KY 
Curvature Check for 


| updates 


The notion of curvature is central to differential geometry and its numerous applications. 
The aim of this chapter is to offer a first introduction to curvature. Our main point is to 
explore how curvature connects to algebraic geometry. We start out with plane curves, 
and we then turn to more general real algebraic varieties. The third section addresses the 
fundamental question of how to compute the volume of a tubular neighborhood of a variety. 


6.1 Plane Curves 


We consider a smooth algebraic curve C C R?. This is given to us in its implicit represen- 
tation, as the zero set of an irreducible polynomial of degree d > 1 in two real variables: 


f(%1,%2) € R[x, x2]. 


We are interested in the curvature of the curve C at one of its points x = (x1,x2). 
Geometrically, the curvature is defined as the rate of change at x of a unit normal vector 
traveling along the curve C. To be precise, we first define 


N(x) := f(x), (6.1) 


er, 
IVACOII 


where the column vector V f(x) = (0f/0x1, ð f /0x2)" is the gradient of the polynomial f. 
For all x € C, the formula for N(x) returns a normal vector of C at x. One calls N(x) a 
unit normal field. Similarly, a unit tangent field is given by the row vector 


T(x) := (N(x)2,-N(x)1)- (6.2) 


The curvature of C at x is defined as the (signed) magnitude of the derivative of the unit 
normal field N(x) in tangent direction. Thus, the curvature is the following scalar quantity: 


c(x) i= (Tœ, T(x), - sein) + T(x): s(x). (6.3) 
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This measures the rate of change of N(x) as it travels along C. Since the derivative of a unit 
normal field at a curve always points in tangent direction, the definition of the curvature 
c(x) in (6.3) is equivalent to the following identity: 


ro Ewro Mew = eTo. 
x1 Ox 


Fig. 6.1: The left picture shows the unit normal field of an ellipse, and the right picture shows how the 
unit normal field changes when traveling along the ellipse. The lengths of the tangent vectors in the right 
picture indicate the curvature of the ellipse at its various points. 


Example 6.1 Consider the ellipse defined by f(x) = x +4x5 — 1. The left picture in Figure 
6.1 shows this ellipse in green and its unit normal field N(x) in yellow. The right picture 
displays the curvature via c(x) - T(x). The magnitude of a yellow vector attached to a point 
x in the right picture gives the curvature at x. On the top and bottom, where the ellipse is 
rather flat, the normal vectors do not change much, hence the curvature is small. On the 
sides, the normal vectors change more rapidly, so there the curvature is larger. © 


The inverse of the signed curvature is denoted r(x) := c(x)~!. This quantity is called the 
(signed) radius of curvature. Indeed, the curve C contains an infinitesimally small arc of a 
circle with radius |r(x)| and center x — r(x) - N(x). This is the circle that best approximates 
C at x. The center of this circle is called a focal point or center of curvature of the curve C 
at the point x. The reason for the negative sign in this formula is that a normal vector 
pointing towards the focal point changes towards the direction that is opposite to T(x); see 
Figure 6.2 for an illustration. We now connect to our historical discussion. 


Proposition 6.2 The Zariski closure of the set of all centers of curvature of a plane curve C 
is the evolute of C, as defined in Section 1.3. 


Proof We consider a local parametrization y(t) of C with y(0) = x and y(0) = T(x). 
Let E denote the curve that is traced out by the centers of curvature of C. Then, the curve 
that is defined by e(t) := y(t) — r(y(t)) - N(y(t)) gives a local parametrization of E. The 
derivative of this parametrization at t = 0 equals 


&(0) = T(x) - (Wr(x),T(®)) NE — r(x): (TR) SE) HTa FE). 
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Since T(x), - Se (x) + T(x) ° SN (x) = c(x) - T(x) = r(x)! - T(x), we get 
&(0) = —(Vr(x), T(x)) - N(x). (6.4) 


Hence the tangent line of E at (0) = x—r(x) - N(x) is the normal line of C at x. The curve 
E is the envelope of the normal lines. In other words, the curve E is the evolute of C. o 


T(x) 


Fig. 6.2: The green curve contains an infinitesimally small arc of the dashed circle. The grey center 
of the circle is a focal point of the green curve. The red normal vector N(x), pointing towards the 
focal point, changes into a normal vector that is slightly tilted in the direction opposite to T(x). Hence 


N(x) := T(x), ons) +T(x)2 ON) isa negative multiple of T(x). 


xı Ox2 


The previous proof shows that the absolute value of the curvature |c(x)| is the inverse 
distance from x to its corresponding point on the evolute. Indeed, the latter point is 
x — r(x) - N(x), and so its distance from x is |r(x)| = |c(x)|7!. In the remainder of this 
section, we will study two types of points: inflection points and points of critical curvature. 
These points exhibit special curvature of C. Inflection points are points where C is locally 
flat, and critical curvature points are points where the curvature has a local extremum. 


Definition 6.3 Let x € C. We call x an inflection point if the curvature is zero, i.e. c(x) = 0. 
We call x a critical curvature point if x is a critical point of the function C —> R, x + c(x). 


Example 6.4 We consider the Trott curve f(x) = 144(x4+y4)—225(x?+y?)+350x?y?+81, 
as we did in Figure 2.1. This has degree d = 4. Figure 6.3 shows the curve with its 
points from Definition 6.3. We first compute inflection points using the numerical software 
HomotopyContinuation. jl [31]. This is based on the formulation in Lemma 6.7. 


using HomotopyContinuation, LinearAlgebra 

@var x y z; v = [x; y; z]; 

F = 144*(x^4 + y^4) - 225*(x^2 + y^2) + 350*x^2*y^2 + 81*Zz^4; 
dF = differentiate(F, v); 

HO = differentiate(dF, v); 

f = subs(F, z=>1); 

h = subs(-det(HO), z=>1); 

inflection_points = solve([f; h]) 
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By Theorem 6.8, a general curve of degree d has 3d(d— 2) = 24 complex inflection points. 
Felix Klein [107] proved that at most d(d — 2) = 8 can be real. Indeed, for the Trott curve, 
we find 8 real inflection points. They are the yellow points in Figure 6.3. 
Next, we compute critical curvature points using the equations in Lemma 6.10. 

f1, £2 = dF[1:2] 

f11, £12, £12, £22 = HO[1:2,1:2]; 

hx, hy = differentiate(h, [x; y]); 

g = f1 * f2 * (£11-£22) + £12 * (£242- £142); 

c = subs(f2 * hy - f1 * hx - 3 * h * g, z=>1); 

crit_curv = solve([f; c]) 


By Theorem 6.11, there are 2d(3d — 5) = 56 complex critical curvature points. We find 
that 24 of them are real; out of these, 8 are close (but not equal) to the 8 inflection points, 
which is why they are not visible in the picture. The other 16 critical curvature points are 
shown in red in Figure 6.3. By Proposition 6.5, the critical curvature points correspond to 
cusps on the evolute, which is illustrated in Figure 6.4. © 
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Fig. 6.3: The Trott curve (2.7) with its inflection points (yellow) and critical curvature points (red). 


Proposition 6.5 Let C C R? be a smooth algebraic curve and E C R? its evolute. For any 
point x € C, let r(x) be the radius of curvature and let T(x) := x — r (x) - N(x) € E be the 
corresponding point on the evolute. Then 


(a) x is an inflection point if and only if T(x) is a point at infinity, and 
(b) x is a point of critical curvature if and only if E has a cusp at T(x). 


Proof Recall that c(x) = r(x)~!. The point T(x) is at infinity if and only if c(x) = 0, 
which means that x is an inflection point. This proves the first item. For the second item, 
we consider a local parametrization y(t) of C with y(0) = x and y(0) = T(x). As in the 
proof of Proposition 6.2, the function e(t) := r (y(t)) gives a local parametrization of the 
evolute Æ. The evolute has a cusp at I(x) = (0) if and only if ¿(0) = 0. By (6.4), the latter 
is equivalent to (Vr(x), T(x)) = 0, where T(x) is the unit tangent field (6.2). This identity 
holds exactly when the curvature c(x) is critical at x since Ve(x) = —Vr(x)/r(x)?. o 
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Fig. 6.4: The picture shows the Trott curve (2.7) in blue and its evolute in red. The cusps on the evolute 
correspond to the red critical curvature points in Figure 6.3. The Trott curve has 24 real critical curvature 
points. Out of those, 8 have a radius of curvature that exceeds the boundary of this picture. This is why 
we only see 16 cusps. We thank Emil Horobet and Pierpaola Santarsiero for helping us by computing the 
equation for the evolute. 


Our next goal is to count the number of complex inflection and critical curvature points 
for a curve given by a general polynomial f of degree d. For this, let us first understand 


the curvature c(x) better. In the following, we denote partial derivatives by f; := of and 
fig = L, With this, the Hessian equals 
fia fiz 
H := i aie 
: 1,2 J22 


Lemma 6.6 The curvature of C at x equals 


c(x) = - T(x)H(x) T(x)". 


1 
IV F(x) | 
Proof By applying the product rule to (6.1), we obtain 

T): 2X (x) +T): SX (x) = VEO HETAT + a(x) Vf) 
for some scalar function a(x). Since (T(x), V f (x)) = 0, Equation (6.3) gives the assertion. o 
We can write the formula from Lemma 6.6 more explicitly as 
fia ff -2fi2 fi ht hz f? 
earn 
(AP +43)? 


This formula is appealing to algebraic geometers because it writes curvature in terms of 
the polynomial f(x). To make it even more appealing, we now embed C into the complex 


c(x) = (x). (6.5) 
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projective plane. We write 
d X1 X2 
F(xo, x1, X2) := x9 r(2.2) 
0 


for the homogenization of f. The Hessian of the ternary form F is the 3 x 3 matrix 


Foo Fo,1 Foz 
Ho = |For Fia Fi,2] - (6.6) 
Fo,2 F1,2 F2,2 


But, we now view the entries F;,; as inhomogeneous polynomials in the two variables 
x = (X1,X2) by setting x9 = 1. We can rewrite the curvature of C in terms of F. The next 
lemma goes back to Salmon [156]. 


Lemma 6.7 The curvature of the degree d curve C at the point x is equal to 
— det Ho 


c(x) = —— 2 _(), 


(d= 1) (+s) 


Proof By homogenizing the polynomials in (6.5), we get c = 5, where 


P 
Q’ 
2 2 2 23 
P= Fi ‘ F; —2F\2 -Fie Fo + F22 ‘ Fi and Q = (Fi + Fy)?. 
By Euler’s formula for homogeneous polynomials, we have 
(d-1)-Fj = xoFo, j; +x1F1, j +xX2F7,j, O0<j <2. (6.7) 


Substituting this into P gives the following equation: 


(d-1)?-P =(Fii Fa - Fi») (xT Fi +5 F22 + 2(xox1 Fo,1 + xox2Fo,2 + x1x2F1,2)) 
+ x5 (Fo, Fo,2 = 2F\ 2Fo,1F 0,2 + Fi F$). 
Note that the equation 0 = dF = xo Fo+x,F|+X2F> holds on the curve C. Substituting (6.7) 


into this equation, we obtain X Fia +x2F7 ,+2(xox1 Fo,1 +xox2F0,2 +x1x2F1,2) = =x? Fo,0. 
This identity holds on the curve C. From this we conclude 


2 
xX 
P = E - (-(F1,1F2,2 — Fi) Foo + Fo F22 — 2F1,2Fo,1Fo,2 + Fin F8). 
The latter is -x2 - det Ho/(d — 1)*. Setting xo = 1 finishes the proof. o 


We can now count the inflection points of a plane curve. This is due to Felix Klein [107]. 


Theorem 6.8 The number of complex inflection points of a general curve C of degree d 
is 3d(d — 2). 


Klein also proved that the number of real inflection points is at most d(d — 2). So, the 
real bound is one third of the complex count. Here, we give a short proof for Theorem 6.8. 
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Proof By Lemma 6.7, the inflection points on C are defined by the equations f = det Hp = 
0. This is a system of two polynomial equations in two variables x = (x1, x2). The degree 
of f is d and the degree of det Ho is 3(d — 2). Bézout’s theorem implies that the number of 
inflection points is at most 3d(d — 2). To show that the number is also at least 3d(d — 2), 
we use the Parameter Continuation Theorem (Theorem 3.18), and we show that there exist 
degree d curves with this number of inflection points. 


Consider a univariate polynomial g(x;) € R[x,] of degree d and let G (xo, x1) be its 


homogenization. We write M := ie pe | for the Hessian of the binary form G (xo, x1). 


We assume (1) that g has d regular zeros, (2) that det M = 0 has only regular zeros, and (3) 
that G = det M = 0 has no solutions in P!. All three are Zariski open conditions, so almost 
all polynomials g satisfy this assumption. Define 


F(x1,%2) = xf - g(x). 


The Hessian (6.6) of the plane curve f satisfies det Ho = d(d-1)-x$~?-det M. Consequently, 
det Hp = 0 if and only if either x2 = 0, or xı is among the 2(d — 2) zeros of det M. This 
means that x is an inflection point if either x = (x;,0) and x, is a zero of g, or x = (x1, x2) 
where x, is a zero of det M and A = g(X}). 

In the first case, we find d inflection points, and each has multiplicity d—2. In the second 
case, since g(x1) + 0, we find 2d(d — 2) many regular inflection points with multiplicity 
one. Now, if we perturb f slightly, then the d points with multiplicity d — 2 will split into 
d(d — 2) inflection points, while the other 2d(d — 2) inflection points will remain distinct. 
In total, this gives 3d(d — 2) inflection points. o 


Corollary 6.9 For a general plane curve C of degree d, the evolute has degree 3d(d — 1). 


Proof We compute the degree by intersecting the evolute with the line at infinity. For that, 
we consider the Zariski closure Č of the curve C in the complex projective plane Pi: By 
Proposition 6.2, the evolute is the image of T : C > P2, x + x -r (x) - N(x). A point P(x) 
on the evolute can be at infinity for two reasons: Either x € C is at infinity, or x is a finite 
point and a complex inflection point of the curve C by Proposition 6.5. Since C is a general 
curve of degree d, there are d points x of the first kind and 3d (d — 2) points x of the second 
kind, by Theorem 6.8. For each of the d points x € C at infinity, Salmon [156, §119] shows 
that T(x) is a cusp whose tangent line is the line at infinity. Hence, when intersecting the 
evolute with the line at infinity, there are d cusps (that count with multiplicity three each) 
plus 3d(d — 2) points that correspond to the complex inflection points of C. All in all, the 
degree of the evolute is 3d + 3d(d — 2) = 3d(d — 1). o 


Let us now find polynomial equations for critical curvature. 
Lemma 6.10 The points of critical curvature on the curve C = { f (x) = 0} are defined by 


ô det Ho ô det Ho 


V) 
HR) AR: Ox 1 Ox 


—3det Ho: g = 0, 


where g := fifo: (fia = fad) + fial- £7). 
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Proof Critical curvature points on C are defined by the equations 


fx) = fol) 22 w SO - 
By Lemma 6.7 and the product rule, we have 
-a-a fyb POD = SOMO Ra 9) — aero (i fur fad: 
This implies the polynomial equation stated above. o 


Theorem 6.11 A general plane curve C of degree d has 2d(3d — 5) critical curvature 
points over the complex numbers C. 


Proof Recall that critical curvature points correspond to finite cusps of the evolute by 
Proposition 6.5. Piene, Riener, and Shapiro prove in [146, Proposition 3.3] that, counting 
in the complex projective plane, the number of cusps on the evolute for a general plane 
curve C of degree d is 6d? — 9d. As explained in the proof of Corollary 6.9 above, 
Salmon [156, §119] shows that d of these cusps lie at infinity. Therefore, the curve C has 
6d? — 9d — d = 2d(3d — 5) complex critical curvature points. o 


6.2 Algebraic Varieties 


We now turn to smooth algebraic varieties in R” of any dimension. Our aim is to study 
their curvature. This will lead us to the notions of the second fundamental form and the 
Weingarten map. These are fundamental concepts in Riemannian geometry. In standard 
textbooks, these concepts are presented in much more general contexts; see, for instance, 
[58, 120, 140]. Our main goal in this section is to formulate the second fundamental form 
and Weingarten map in terms of the polynomial equations that define the variety. 

Let X c R” be a smooth algebraic variety of dimension m. As in the case of plane 
curves, we consider a unit normal field N (x) for X and we differentiate it along a tangent 
field T(x). The main difference to the case of plane curves is that there are usually many 
tangent directions and many normal directions. In fact, every choice of normal and tangent 
direction defines a curvature. Similar to (6.3), we define the curvature of X at a M xin 
tangent direction T(x) and in normal direction N(x) to be (T(x), T(x), - gN (x) ++ 
T(X)n° ge (x)). We will see that, as for plane curves, this only depends on en values of 
T(x) snd N (x) at a fixed point x, but not on how those fields behave locally around x. 

Let us work this out. We assume that the Zariski closure of X is irreducible. Its ideal 


I(X) = (fis... fk) 


is prime. We denote the gradients of the polynomials f; by V f; and their Hessians by H; 
for i =1,...,k. The (transpose of the) Jacobian of fi, ..., fg at xis denoted by 
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J) =[VA«. VA] eR”. 
A smooth unit normal field on X is given by 


— J(x)w(x) | 
N@) = T@e@l ~ 7a) weak Yims YAO, oe) 


where w : X — R* is a smooth function with w(x) ¢ ker J(x) for all x. Differentiating 
(6.8) leads to the following equation, for some matrix-valued function R(x): 


d 
aN = EEN Yimin HiX) + J(x) R(X). 


Let us now fix a point x € X. We denote the tangent vector by t := T(x) € T,X and the 
normal vector by v := J(x)w(x) € NxX. Since t” J(x) = 0, this implies 


k 
1 
ON \ _ tl > H; 
(be Sette Be) = gt (2 aN 


The right-hand side only depends on the values of T and N at x. 


Definition 6.12 The curvature of a real algebraic variety X at a smooth point x in tangent 
direction t € 7, X and in normal direction v € NxX is the scalar 


k 
1 
c(x,t,v) := Mt (> Wj ° Hi) t, 
i=l 


where v = ae | WiV fi € NxX. In this formula, we use the notation introduced above. 


Remark 6.13 If the codimension of X is greater than k, then the w; are not unique. Still, the 
formula for c(x, t, v) is well-defined. Indeed, if pean wi: V fi = 0, then te, w;:H;)t =0 


In fact, for x € X and a fixed normal vector v € NxX, the curvature c(x,t,v) is a 
quadratic form on 7, X. This quadratic form is called the second fundamental form of X at 
x and v. It is often denoted by 


I(t) := c(x,t,v). (6.9) 


The linear map associated with this quadratic form is the Weingarten map. We denote it by 


k k 
Lyi EX OT, (t= 20 wi - Hi - t), where v= w; Vf, (6.10) 
i=1 i=1 


and Px : R” — T,X is the orthogonal projection onto the tangent space of X at x. Since 
Ly is a self-adjoint operator given by a real symmetric matrix, its eigenvalues are all real. 
If ||v|| = 1, the eigenvalues of Ly are called the principal curvatures of X at x and in 
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normal direction v. The product of the principal curvatures is called the Gauss curvature; 
the arithmetic mean of the principal curvatures is called mean curvature. 

Since the principal curvatures are the critical points of the quadratic form II,(t), the 
maximal curvature 


C(X) := max c(x,t,v) = max max c(x,t,v) (6.11) 
XEX, tel, X, VENxX XEX, VENXX teTyxX 


is the maximum over all principal curvatures for varying (x, v). 


Example 6.14 (Hypersurfaces) If X is defined by one polynomial f (x), then we have only 
one normal direction (up to sign). Here, the formula in Definition 6.12 can be written as 


c(x,t) := " At, 


—— t 
IVAI 
where H is the Hessian of f. This generalizes the formula in Lemma 6.6. © 


Let us now focus on surfaces in R3. Let S$ c R? be a smooth algebraic surface and x € S. 
When the two principal curvatures of S at x are equal, the point x is called an umbilic or 
umbilical point of the surface S. Equivalently, the best second-order approximation of S at 
x is a 2-sphere. Umbilical points can be formulated as the zeros of a system of polynomial 
equations, whose complex zeros are called complex umbilics of the surface S. Salmon [157] 
computed the number of complex umbilics of a general surface. 


Theorem 6.15 A general surface of degree d in R? has 10d? —28d?+22d complex umbilics. 


In the case of surfaces of degree d = 2, we have results on the number of real umbilics 
and critical curvature points. Observe that rotations and translations do not affect the 
curvature, and that after a rotation and translation every quadric surface in R3 has the form 


2 2 2 
S = {ax} + a2x5 + a3x3 = 1}. 


By Theorem 6.15, the surface S$ has 12 complex umbilics. The next theorem is proved 
in [29]. We shall assume that a}a2a3(a, — a2)(a1 — a3)(az2 — a3) + 0. 


Theorem 6.16 The number of real umbilics of the quadratic surface S equals 


e 4 if S is an ellipsoid (aj, a2, a3 are positive) or a two-sheeted hyperboloid (one of the 
a; is positive and two are negative); 
e 0 if S is a one-sheeted hyperboloid (two of the a; are positive and one is negative). 


Similar to the case of plane curves, we call a point x on a surface S a critical curvature 
point if one of the two principal curvatures of S attains a critical value at x. The first 
observation is that umbilics are always critical curvature points. This was shown in [29]. 
The following result from [29] covers the case of quadrics. 


Theorem 6.17 A general quadric surface S C R? has 18 complex critical curvature points. 
The number of real critical curvature points equals 


e 10 if S is an ellipsoid (a1, a2, a3 are positive); 
e 4 if S is a one-sheeted hyperboloid (two of the a; are positive and one is negative); 
e 6 if S is a two-sheeted hyperboloid (one of the a; is positive and two are negative). 
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Fig. 6.5: The pictures illustrate Theorems 6.16 and 6.17. The figure on the left shows an ellipsoid with 4 
red umbilics and 6 green critical curvature points. The umbilics are also critical curvature points, so there 
are 10 in total. Similarly, the figure in the middle shows a one-sheeted hyperboloid with 4 green critical 
curvature points, and the figure on the right shows a two-sheeted hyperboloid with 4 red umbilics and 2 
green critical curvature points (so 6 critical curvature points in total). 


6.3 Volumes of Tubular Neighborhoods 


In this section, we study the volume of a tubular neighborhood of a real algebraic variety. 
This is closely connected to curvature, as we will see. Methods for computing volumes of 
semialgebraic sets numerically will be presented in Chapter 14. 

The tubular neighborhood of radius € of a variety X C R” is the set 


Tube(X,£) := {u € R” | d(u,X) < €}, 


where d(u, X) = minxex ||u — x|| is the Euclidean distance from u to X. There are several 
general formulas for upper bounds on the volume of Tube(X, €) in the literature. For 
instance, Lotz [125] studied the case of a general complete intersection. Biirgisser, Cucker, 
and Lotz studied the case of a (possibly) singular hypersurface [36] in the sphere. 

We now state the most general formula, due to Basu and Lerario [14]. Their theorem 
also holds for singular varieties. The proof of the theorem is based on approximating X in 
the Hausdorff topology by a sequence of smooth varieties (Xx)xen and showing that the 
volume of the tubular neighborhood of Xx can be controlled as k — ov. 


Theorem 6.18 Let X c R” be a real variety of dimension m, defined by polynomials of 
degree < d. Fix u € R” and let B,(u) denote the ball of radius r > 0 around u. For 


every 0 < € < r/(4dm + m), we have 


vol(Tube(X, £) N B,(u)) 2e (2s) 


vol(B, (u)) r 


The volume of tubular neighborhoods of smooth varieties (in fact, of smooth submani- 
folds of R”) is given by Weyl ’s tube formula. We shall now derive this formula. For a more 
detailed derivation and discussion, we refer to Weyl ’s original paper [175]. 
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Let X c R” be smooth and NX be the normal bundle of X. The ¢-normal bundle is 
NX := {(x, v) € NX | ||v|| < £}. 

The (normal) exponential map is the following parametrization of the tubular neighborhood: 

Ye : NX > Tube(X, £), (x, v) = x+V. (6.12) 
Definition 6.19 The reach of X is defined as 
T(X) := sup fe >0|psisa diffeomorphism}. (6.13) 


If X is smooth and compact, then the set in (6.13) is non-empty, and hence the reach 
t(X) is a positive real number; see, e.g., [121, Theorem 6.24]. See Chapters 7 and 15 for 
further properties of the reach. In what follows, we assume that X is compact. If not, then 
we replace X by the semialgebraic set X N B, where B is a ball. 

The desired volume is the integral of the constant function 1 over the tube. If € < T(X), 
then the exponential map y, is a diffeomorphism and we can pull that integral back to the 
normal bundle NX. To be precise, let A be the matrix that represents the derivative of y, 
with respect to orthonormal bases. We have 


vol(Tube(X,¢)) = I du = f f |det(A(x, v))| dv dx. (6.14) 
XEX JveNy 


Tube(X, £) X:||v||<e 
We compute the matrix A := A(x, v). It represents a linear map 
Taw) NX = TX ONyX > R” = X ON,X. 


Let Bı be an orthonormal basis for the tangent space 7, X and B2 one for the normal 
space N,X. An orthonormal basis for T,X ® NxX is {(t,0) | t € Bı} U {(0,z) | Z € Bo}. 
We compute the image of these basis vectors under A. First, lett € By. If v = A-w, A = |lv|I, 
and Ly denotes the Weingarten map from (6.10), then we have 


A a = t+(-Ly)t. 
This is because (t, 0) is the tangent vector of a curve (x(t), v(t)) E€ NX passing through 
(x(0), v(0)) = (x, v), such that ||v(t)|| is constant and equal to 4. If the derivative of x(t) 
at t = 0 is t, then the derivative of v(t) at t = 0 is (A - Lw)t, since the image of t under 
the Weingarten map Ly is the derivative of the unit normal vector A! - v(t). Linearity of 
differentiation implies that A (t,0)" is the sum of these two terms. 

Next, we consider a basis vector (0,z) for z € Bo. It is the tangent vector of a curve 
(x(t), v(t)) in NX, where x(t) is constant and v(t) is a curve in the normal space NxX, 
whose derivative at t = 0 is z. This shows the equation 
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Therefore, a matrix representation with respect to orthonormal bases is 


Imta-Ly 0 


ay) = 0 In. 


,  m=dimX. (6.15) 


Since A = ||v|| < 7(X), the eigenvalues of 4 - Ly have absolute value at most 1. This implies 
that the determinant of A(x, v) is positive, and therefore 


| det(A(x, v))| = det(, +2 - Ly). 


The transformation v > (w, A) has Jacobian determinant 4”7™-1, Plugging all this into 
the inner integral in (6.14), we arrive at the following integral for the volume of the tube: 


vol(Tube(X, €)) = / f An! . det( Im + A» Lw) dw da dx. 
xEX J A=0 J wEN;X:||w]|=1 


Expanding the characteristic polynomial inside this integral, we see that vol(Tube(X, £)) 
is a polynomial in ¢ of degree n whose coefficients are integrals of the principal minors of 
the Weingarten map Ly over X. Since L-y = —Ly, the integrals over the odd-dimensional 
minors of Ly vanish. All this leads to: 


Theorem 6.20 (Weyl’s tube formula [175]) In the notation above, we have 


vol(Tube(X,¢€)) = = Kai (X): glint 2i 


0<2i<m 
The coefficients K2; (X) of this polynomial are called curvature coefficients of X. Explicitly, 


1 
Ka; (X) = l I Gaii 
n-m+2i Jyex we N,X:||w||=1 


where m2;(-) denotes the sum of the principal minors of format 2i x 2i. 


In fact, the curvature coefficient xo(X) is always equal to the m-dimensional volume 
of X times the volume of the unit ball B”=™ = {x € R"~” | ||x|| < 1}. This yields: 


Corollary 6.21 The volume of a compact variety X of dimension m in R” is equal to 


1(Tube(X 
aigoa ma ee 
e>0 €"—™ . yol( B”) 


We close this chapter by discussing Weyl’s tube formula in two low-dimensional cases. 
Example 6.22 (Curves) If X c R” is a smooth algebraic curve, then 
vol(Tube(X,£)) = £"! . vol(B™!) - length(X). 


For instance, the volume of the ¢-tube around a plane curve C is 2e - length(C). © 
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Example 6.23 (Surfaces in 3-space) Let S = {f(x) = 0} c R? be a compact smooth 
algebraic surface with Euler characteristic x(S). For € < T(S), the volume of its ¢-tube is 


2e area(S) + £° vol(B?) y(S). 
Let us prove this. Weyl’s tube formula implies 
vol(Tube(S, ¢)) = 2¢ area(S) + €°x»(S), 


where the coefficient x2(S) is the integral of the Gauss curvature: 


2 
K2(S) = =f Eno) dx, 
xE 


where N(x) = ||V f(x)||7! - V f(x) denotes the normal field of S at x given by the gradient 
of f. The surface S is orientable since it lives in R°. Indeed, the orientation is given 
by the normal field N(x). The Gauss—Bonnet theorem (see, e.g., [120, Theorem 9.3]) 
implies that the integral over the Gauss curvature is 27 - y (S). Moreover, the volume of the 
three-dimensional unit ball is vol( B?) = 47/3. ° 
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Chapter 7 A 
Reach and Offset 


Check for 
| updates 


In this chapter, we study the medial axis, bottlenecks, and offset hypersurfaces. These 
notions are intuitive and important for many applications. They will also lead to a better 
understanding of the geometry of the reach, which was introduced in Definition 6.19. The 
last section is devoted to the offset discriminant of a variety. This offers a direct link between 
the ED problem in Chapter 2 and differential geometric concepts we saw in the previous 
chapter, namely the second fundamental form (6.9) and the Weingarten map (6.10). 


7.1 Medial Axis and Bottlenecks 


The medial axis Med(X) of a set X c R” is the set of points u € R” such that there exist 
at least two distinct points on X at which the distance from X to u is attained. In other 
words, the medial axis is the locus of points in R” where the Euclidean projection to X is 
not well-defined. We shall later focus on the case when X is a smooth variety in R”. 


Proposition 7.1 If X is a semialgebraic set in R” then its medial axis Med(X) is also a 
semialgebraic set. 


Proof The sentence in the introductory paragraph that defines the medial axis can be 
expressed using polynomial inequalities, together with the existential quantifier in “there 
exist at least two points”. By Tarski’s Theorem on Quantifier Elimination, there exists a 
quantifier-free formula for Med(X). The medial axis is, therefore, semialgebraic. o 


The reach t(X) was defined as the supremum over all ¢ such that the exponential map 
of X restricted to normal vectors of length < € is a diffeomorphism. This implies that all 
points u € R” whose distance to X is less than T(X) have a unique closest point on X. 
Consequently, the distance between X and its medial axis must be at least t(X). This shows 
that the reach is intimately linked with the medial axis. We make this connection more 
precise in the following result. Proposition 7.2 can be used as a definition for the reach of 
singular varieties. Yet another characterization of the reach will be given in Theorem 7.8. 
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Proposition 7.2 The distance from a smooth real variety X to its medial axis Med(X) is 
the reach T(X). 


Proof The exponential map yz : NsX — Tube(X,¢) was defined in (6.12) by the 
formula (x, v) œ> x + Vv. Suppose £ < T(X). Then gy, is a diffeomorphism, and the tubular 
neighborhood Tube(X, €) is disjoint from the medial axis Med(X) since otherwise y, 
cannot be injective. Hence, the distance from X to Med(X) is at least t(X). To show the 
reverse inequality, let € > 0 be less than the distance from X to Med(X). Fix a point u 
in Tube(X, £). The Euclidean closure of Med(X) does not contain u. Thus, in an open 
neighborhood U c Tube(X, ) of u, the points have a unique closest point on X. This 
defines a smooth map U > X,u+ x(u). The map Yu : U > N-X,ub (x(u),u—x(u)) 
is then smooth with smooth inverse y}! = gel wy (U)- Via a partition of unity of Tube(X, £) 
(see e.g. [121, Chapter 2]), we can obtain a global smooth inverse of ys from the local 
inverses Wy. Hence, Ye is a diffeomorphism and e < T(X). o 


Example 7.3 The reach of the parabola X in Example 7.4 is T(X) = 5. It is realized as 
the distance between (0,0) € X and (0, 5)s which is a point in the Euclidean closure 
of Med(X) = {(0,u2) € R? | m > 4}. o 


We define the algebraic medial axis as the Zariski closure of the medial axis. In symbols, 
Mx := Med(X). 


The variety Mx is our algebraic proxy for the medial axis Med(X). By Proposition 7.1, the 
variety Mx and the semialgebraic set Med(X) have the same dimension. 


Example 7.4 Consider the parabola X = V(x2 =x). We compute the algebraic medial axis 
of X. The result is shown in Figure 7.4 below. If x = (x1, x2) € X minimizes the distance 
to a point u = (u1, u2) € RŽ, then we have (x — u, t) = 0, where t spans the tangent space 
T,X. We use Macaulay2 [73] to compute My algebraically: 


R = QQ[x1, x2, yl, y2, ul, u2]; 


fx = x2 - x142; fy = y2 - y1^2; 
Jx = matrix {{xl-ul, x2-u2}, {diff(xl, fx), diff(x2, £x)}}; 
Jy = matrix {{yl-ul, y2-u2}, {diffCy1, fy), diff(y2, fy)}}; 


distxu = (xl - ul)42 + (x2 - u2)42; 

distyu = (yl - ul)42 + (y2 - u2)42; 

I = ideal {fx, fy, det(Jx), det(Jy), distxu - distyu}; 
K = saturate(I, ideal {xl-yl, x2-y2}); 

eliminate({xl, x2, yl, y2}, K) 


This returns the ideal (u1), so the algebraic medial axis is My = {u1 = 0}. © 


Remark 7.5 Propositions 7.1 and 7.2 imply that the reach of a real variety is an algebraic 
notion. More concretely, if X is smooth and defined by polynomials with rational coeffi- 
cients, then its reach t (X) is an algebraic number over Q. This was shown by Horobet and 
Weinstein in [92, Proposition 3.14]. 


Our next goal is to characterize the reach of a smooth variety X c R” in terms of maximal 
curvature and bottlenecks. Curvature is discussed in detail in the previous chapter. We now 
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introduce bottlenecks. Let x, y € X be two distinct points. If x — y is normal to T,X (i.e., 
{t,x — y) = 0 for all t € T,X) and also normal to T,X, then we call {x, y} a bottleneck. 
Complex solutions to the corresponding polynomial equations are complex bottlenecks. 
Di Rocco, Eklund and Weinstein [57] expressed the number of complex bottlenecks of a 
variety in terms of polar classes. The following theorem is their result for planar curves. 


Theorem 7.6 A general plane curve of degree d has 5(d4 — 5d? +4d) complex bottlenecks. 


Example 7.7 We compute the bottlenecks of the Trott curve from (2.7) using the numerical 
software HomotopyContinuation. jl [31]: 

using HomotopyContinuation, LinearAlgebra 

@var x y u v 

f = 144*(x^4 + y^4) - 225*(x^2 + y^2) + 350*x^2*y^2 + 81; 

g = subs(f, x=>u, y=>v); 

df = differentiate(f, [x; y]); dg = differentiate(g, [u; v]); 

N = [x-u; y-v]; 

bottlenecks = solve([f; g; det([N df]); det([N dg])]) 
By Theorem 7.6 there are 5(d4 — 5d? + 4d) = 96 complex bottlenecks. Our computation 
reveals that 36 of the bottlenecks are real. They are marked in Figure 7.1. We note here that 
one point can appear in more than one bottleneck. © 


— Trott curve 
@ bottlenecks 


tee 


Fig. 7.1: Bottlenecks of the Trott curve (2.7) are displayed as grey normal lines with yellow endpoints. 


The width of a bottleneck is b(x, y) := $|Ix — y||. We denote the width of the smallest 
bottleneck of the variety X by 


B(X) := i b(x,y). 
( ) fey hotrienedeot X cs y) 


The next theorem links the reach of X to its bottlenecks and its maximal curvature C(X). 
Recall that C(X) was defined in Equation (6.11). 


Theorem 7.8 Let X be a smooth variety in R”. Then the reach of X equals 


t(X) = min [s0 a}: 


84 7 Reach and Offset 


Proof Recall from (6.13) that the reach r(X) is the supremum over all € > 0 such that the 
exponential map Ys : NX — Tube(X, £), (x, v) + x + v is a diffeomorphism. Let 


E€ := T(X). (7.1) 


For e’ > e, the exponential map ye is not a diffeomorphism. The map is either not an 
immersion or it is not injective. Thus, there is a point u = x + v € R”, where (x, v) € NX, 
at distance € = ||v|| from X such that, for e’ > g, either the derivative of y s at (x, v) is not 
injective or ae (u) has at least two elements. 

Suppose first that the derivative of each ys at (x, v) is not injective. Then it follows 
from (6.15) that e~! is a principal curvature at x in normal direction —e~!v. According to 
(7.1), € is the smallest positive number with that property. Therefore, its inverse £7! must 
be the maximal curvature C(X). 

The remaining case to analyze is when each fiber oo (u) contains at least two points 
and the derivative of Y is injective at these points. We have two distinct points x,y € X 
such that u = x + v = y + w, where (y, w) € NX and 6 := ||w|| < £. We distinguish two 
subcases. First, we assume that u lies on the line spanned by x and y. Then, {x,y} is a 
bottleneck. Moreover, u must be the midpoint between x and y; otherwise there is a o with 
b(x,y) < o < € and the fiber of the midpoint 5(x+ y) under yo would contain at least two 
points, but the latter implies t(X) < o < €; a contradiction to (7.1). Hence, € = b(x,y). 
By (7.1), there cannot be any smaller bottleneck, so that £ = B(X) and we are done. 

Second, we assume that x, y, u form a triangle. Since the derivative of y » is injective at 
both (x, v) and (y, w), the Inverse Function Theorem implies the existence of two locally 
defined and smooth maps u + x(u) and u +> y(u) that project locally around u to X. 
These define two local smooth functions d,(u) := ||u — x(u)|| and dy(u) := |lu — y(u)|| 
that locally measure the distance to X. Their gradients are 


Vdx(u) =e '(u-—x) and Vdy(u) = 5 '(u-y); 


see e.g. [72, Lemma 2.11] and also Remark 7.9. 
Let a be a unit norm vector that starts at u and points inside the triangle with vertices 
x, y, u such that (u — x,a) < 0 and (u—y,a) < 0. Here is an illustration: 
u 


PS 


X y 
; Fay eee : ôd. 
The partial derivatives of dx(u) and dy(u) in direction a satisfy 2d) < 0 and oe) < 0. 
When we move from u in direction a, the local distances from u to X both decrease. Hence, 
there is ao < e such that o is not injective. Thus, T(X) < o < e. This contradicts (7.1) 
and so x, y, u cannot form a triangle. o 


Remark 7.9 As above, let d be a (locally) defined projection map to a variety X, such that 
x = d(u). Write v := (u—x)/||u—x|| for the unit normal direction. We claim that Vd (u) = v. 
An informal proof is as follows: If we move from u infinitesimally in a direction that is 
perpendicular to v, then the distance d(u) does not change. This means that the derivative 
of d(u) in a direction perpendicular to v is zero, hence the gradient of d(u) must be a 
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multiple of v. On the other hand, d(u+t-v) = d(u) + t, which shows that the derivative of 
d(u) in direction v is 1. Consequently, Vd(u) = v. In particular, the Weingarten map Ly 
can be obtained via the second derivative of d(u). This is worked out in (7.5) below. 


7.2 Offset Hypersurfaces 


This section is based on the article [92] by Horobet and Weinstein. We fix an irreducible 
variety X in R”, and we identify X with its Zariski closure in C”. The ED correspondence 
Sy of X is the Zariski closure of the set of pairs (x,u) € Reg(X) x R” such that x is an 
ED critical point for u. We recall from Theorem 2.23 that 


Ex = {(x,x+h) | x € Reg(X), (x,h) € Nx} © Xx. 


The branch locus of the projection 6x — C” is called the ED discriminant or evolute. For 
plane curves, this coincides with the definition of the evolute in Section 1.3. We denote the 
ED discriminant of the given variety X by Xy c C”. 

For s € C and u € C”, the -sphere around wis the variety S(u, £) := V(||x—ul|? — 7). 


Definition 7.10 The offset correspondence of X is the variety 
OCy = (Ex x C)N{(x,u,e) € C” x C” xC | x € S(u,e)}. 


Hence, OCx is the complex Zariski closure of the set of triples (x,u, £) such that x is an 
ED critical point for u, and £? is the squared Euclidean distance (over R) between x and u. 


We consider the two projections 7; : OCx — X and m : OCx —> C” x C. The map 7 
is dominant, i.e. 7;(OCx) = X, because (x, x, 0) € OCx for all x € X. However, the other 
projection is not dominant. The next definition specifies the image of that projection. 


Definition 7.11 The offset hypersurface of X is the variety Offx := 72(O0Cx) c C” x C. 
The next lemma justifies the name. 
Lemma 7.12 codim Off, = 1. 


Proof The ED correspondence &x is the Zariski closure of a vector bundle of rank 
codim(X) over Reg(X), which shows that dim Gy = n. Since X is irreducible, also Ex 
is irreducible. The offset correspondence OCx is the intersection of Gy x C, which is 
irreducible as well, with a hypersurface. This implies dim OCx = n. Since the ED degree 
of X is finite, the projection 7 has finite fibers generically, which implies dim Offy = n.O 


Remark 7.13 For a fixed radius r > 0, the level set Offx, c C” is the intersection of the 
offset hypersurface Off with the hyperplane £ = r in C” x C. Figure 7.3 shows the level 
sets for r = 0.5 and r = 1.25, respectively, of the offset surface of a parabola. The real locus 
of Offx,, always contains the boundary of the tubular neighborhood Tube(X,7r), which 
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is OTube(X,r) = {u € R” | d(u, X) = r}. Indeed, for any u € OTube(X,7r), there is a 
closest point x € X with ||x — u|| = r. In particular, x is an ED critical point for u and 
(x,u,r) € OCx, i.e., u € Offy,,. For the parabola in Figure 7.3, the real locus of Offx,0.5 
is equal to 0Tube(X, 0.5). On the other hand, dTube(X, 1.25) is strictly contained in the 
real part of Offx,1.25. In this example, that change of qualitative behavior occurs when the 
radius r crosses the reach tT(X) = 0.5. 


It follows from Lemma 7.12 that Offy is the zero set of a polynomial gx(u, £); i.e., 
Offx = V(gx) c Œ" xC. 


We call gx the offset polynomial. It is also known as the ED polynomial; see [141]. 


Fig. 7.2: The offset hypersurface Off of the parabola is a surface in RÌ. The surface is symmetric along 
the ¢-axis, because only even powers of £ appear in the offset polynomial g(u, £). The parabola itself is 
visible at level £ = 0. 


Example 7.14 We compute the offset polynomial of a parabola in Macaulay2 [73]: 


QQ[x1, x2, ul, u2, eps]; 

f = x2 - x1^2; d = (x1-u1)^2 + (x2-u2)42 - eps2; 

J = matrix {{xl-ul, x2-u2}, {diff(xl, f), diff(x2, £)}}; 
OC = ideal {f, det(J), dł; 

O = eliminate({xl, x2}, 0C); 

g = sub((gens 0)_0_9, QQ[x1, x2, ul, u2][eps]) 


R 


ou oul 


Setting X = V(x2 - x), this gives gx(u, £) = go(u) + gı (We? + g2(u)e* + 93(u)eE®, where 


go(u) = (u? — ur)” (16u? + 16u2 — 8u2 +1), 

g2(u) = 48u? + 16u? + 32u — 8, 

gi(u) = —48u4 — 32utus + 8u?u — 32u3 — 20u? — 8u5 + 8u — 1, 
g3(u) = -16. 
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Figure 7.2 shows the real part of the offset surface g(u, £) = 0, which lives in the ambient 


space R? = R? x R c C? xC. © 
2 2 
1 1 
0 0 
-1 -1 
23 4 0 2 4 b a | 0 2 4 6 


Fig. 7.3: The offset surface Offx of the parabola intersected with the planes £ = 1.25 and € = 0.5. 


We next examine some properties of the offset polynomial. 


Proposition 7.15 (a) For a general point u € C”, the zeros of the univariate polynomial 
function e +> gy(u, £) are the complex numbers £ = +\||u—x\||?, where x ranges 
over all ED critical points for u on X. 

(b) The degree of the offset polynomial gx (u, £) in the unknown £ is two times the Euclidean 
distance degree of X. 


Proof We observe that the projection Offx — C”, (u, €) + uis dominant, because general 
points in C” have ED critical points on X. Take a general u € C”. Then, gx(u, £) = Oif and 
only if there exists x € X with (x,u) € Sy and £? = ||x—ull*. Therefore, £ = +y||u — x||?, 
where x ranges over all ED critical points for u. In particular, this shows that the univariate 
polynomial function € + gx(u, £) has 2 - EDdegree(X) many zeros for general u. oO 


Example 7.16 In Example 7.14, we see that the offset polynomial gx(u, £) has degree six 
in £. This reflects the fact that the ED degree of the parabola is three. For instance, any 
point u € R? in Figure 7.4 that is both above the blue curve (the parabola) and above the 
yellow curve (the evolute) has three real ED critical points on the parabola. © 


The coefficients of the offset polynomial were studied by Ottaviani and Sodomaco 
in [141]. They prove the following theorem (see [141, Proposition 4.4]). 


Theorem 7.17 If X is general enough and gx(u, €) = co(u) + cı (we* + -- -+ cx (u)e** is 
its offset polynomial, then the leading coefficient c (u) is a real constant. 
7.3 Offset Discriminant 


In the previous section, we introduced the offset polynomial gy(u, £) of a variety X c R”. 
For any fixed u € R”, this encodes the distances from u to its ED critical points on X. If u 
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is on the medial axis, then gx(u, £) must have a double root in £. This motivates us to study 
the discriminant of the offset polynomial with respect to the distinguished variable £. That 
discriminant is a polynomial in u = (w1,..., Un). 


Definition 7.18 The offset discriminant is the polynomial 


ôx(u) := Disc, gx(u, £). 
Its zero set is the hypersurface 
AË = V(ôx) c C. 
6 
5 
4 
3 
2 
1 
0 
-1 
6 z4 -2 0 2 4 6 


Fig. 7.4: The offset discriminant of the parabola X = V (x2 — x7) has three real components: the parabola 
itself (blue), the algebraic medial axis Mx (the red vertical line) and the ED discriminant or evolute Xx 
(the yellow cubic curve). 


Example 7.19 The discriminant of the offset polynomial of the parabola in Example 7.14 is 


Ox(u) = uĵ (uf — u2) - 1 (u)® - ô2 (U) 
where 6 (u) = 27u? +2(2u2—-1)? and 62(u) = 16u? + (4m — 1)’. 


The factor 62(u) has no real zeros, and uy — uz is the polynomial of X. The real zero locus 
of the other factors u; and ô; (u) are shown in Figure 7.4. The medial axis of the parabola 
is Med(X) = {(0, u2) | u2 > 5} and u; = 0 is the Zariski closure of Med(X). The variety 
ôı (u) = 0 is the ED discriminant or evolute of the parabola, also known as the semicubical 
parabola. Above the evolute, every point u € R°? has three real ED critical points on X, 
and below the evolute it has one real and two complex ED critical points. © 


The discriminant AGH in the previous example has three irreducible components: the 
variety X, its algebraic medial axis, and its evolute. We show that this is a general fact, 
following [92]. For that, we define the bisector hypersurface Bisx of the variety X. If we 
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write Bly c C” xC for the branch locus of the projection 22 : OCx — C” xC, the bisector 
hypersurface is the set of all points u that arise from that branch locus: 


Bisx := {u | (u, £) € Blx}. 
Theorem 7.20 The offset discriminant has the following decomposition: 
AQË = Bisy UZy 2 XUMyUZy. 
The real locus of A consists of the real points of the varieties X, Mx, and Xx. 


Proof By Proposition 7.15, the offset discriminant is the locus of those u such that 
gx(u, £) € Cle] has fewer than 2 - EDdegree(X) distinct complex zeros. This can happen 
for two reasons: either u has fewer than EDdegree(X) distinct ED critical points on X, or u 
has two distinct ED critical points 


xı #X2 with ||x; — ull? = ||x2 —ull’. (7.2) 


The first case is accounted for by the ED discriminant 2 and the second case is the bisector 
hypersurface Bisx. By definition, the medial axis Med(X) is contained in Bisx, and thus 
we also have the inclusion My © Bisx. Since the € that come from zeros of gx(u, €) 
come in signed pairs (cf. Proposition 7.15), we see that X x {0} is doubly covered by the 
projection 72. This implies X € Bisx. All other components of Bisx that are not in X U Mx 


consist of non-real points u that have ED critical points as in (7.2). oO 


Suppose now that ug € R"\Aot is a real point outside the offset discriminant. Using 
Remark 7.9, we can compute a unit normal field from the offset polynomial by working 
locally near up. Namely, we first choose € such that (uo, £o) € R” x Ris a real zero of gx. 
Then, by varying u in a small neighborhood of ug, we obtain a function u +> e(u). This 
function is defined implicitly by the equation gy(u, £) = 0. 

We consider the gradient ge (u, £) of the function u > e(u). It follows from Remark 7.9 
that this gradient is a unit normal vector which points from a real ED critical point on X 
towards u. We compute that gradient by implicit differentiation. Namely, by differentiating 
the equation gx(u, £) = 0, we obtain 


=ù 
Ve := ay (th?) = - (2x) . ZEX eR”, (1.3) 


Furthermore, by differentiating (7.3), we obtain the following formula for the Hessian 
matrix of u > e(u): 


Pe _ _(dgx\' (Pex , Pex 
du2 ðe ðu? ðe? 


(1.4) 


Ve (Ve)! +2Ve (28/2) ) , 


ðu 


The expression in (7.4) is an Xn matrix. The entries of the gradient vector and the Hessian 
matrix are rational functions in u, whose denominator vanishes when 8X (u, £) = 0. This 
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is why we assumed uo ¢ A. The next theorem shows that we can compute the second 
fundamental form of X from the Hessian of gx. 


Theorem 7.21 Let u € R"\ At and £ > 0 with gx(u, £) = 0. Let x € X be the (uniquely 
determined) ED critical point corresponding to (u, €). The following two statements hold: 


(a) The gradient d£ (u, £) is a unit normal vector at x pointing towards u. 


(b) Let t € TX be a tangent vector. The second fundamental form of X evaluated at t is 
. [de 
Mua-x(t) = lim t' |—(x+s(u-x), se)| t. 
s Ndw? 


Proof Item (a) follows from Remark 7.9 and we have discussed this above. We need to 
prove item (b). Since u ¢ AQ", there can only be finitely many points on the line through x 
and u that belong to Aot, This implies that there exists s’ > 0 such that x + s(u — x) ¢ Ae 
for all 0 < s < s’. Item (a) implies that, for all O < s < s’, the vector dE (x + s(u — x), sé) 
is a unit normal vector at x pointing towards u. 

Fix 0 < s < s’. By definition of the second fundamental form in (6.9), we know that 
IIu_x(t) is the directional derivative of the unit normal field above in direction t, i.e., 


dd 
Thex(t) = t* E x+ s(u=x),s6)] t forall O<s<-s’. (7.5) 


dx du 


Let Ly denote the Weingarten map at x in normal direction v = (u—x)/||u—x||. By (6.15), 


du Ly 0 
ax =I,+6e- k o| ; 
By applying the chain rule, we find that 
d de _ @e du 
dxdu du? dx 
d?e Ly 0 
= du? (n +E- 0 0 a 


Evaluating this equation at (x + s(u — x), sé), we obtain 


2 


dd 
qui kt s(u — x), se) = ag ht (Ux), se) 
de Lys 0 
~se (TF 0 0 (x + s(u — x), sé). 


If we multiply both sides of this equation from the left by t” and from the right by t and then 
take the limit s — 0, the formula (7.5) implies that we obtain Hu-x(t) on the right-hand 
side. This proves item (b). o 
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Example 7.22 We compute the expression (7.3) for the parabola X = V(x2 — xî). Using 


the offset polynomial gx (u, £) in Example 7.14, we find ge (u, £) = a (hi, hz), where 


hı =~ 96u 64 + (192u} + 64uu3 — 16u1u + 40u) £? — 4u; (u? — ur) (24u? + 16u3 - 16 + 1) 


hy = (—32uy — 32) 4 + (chuu 8u? + 96u5 + 16u2 8) e? 


-2 (u? - ua) (16 utes? - 20u? -32u3 + 12u — 1) 


p=-968%4 (19207 + 64u2 + 1281 32) 2 


H ( 96u] — 64u7u3 + l6utur — 64u3 — 40u? — 16u5 + 16u2 2) £. 
For instance, if we plug in (u1, u2, €) = (0, 1, i). we obtain (h1, h2) = (0, 1), which is the 
unit normal vector on the parabola at x = (0,0) pointing towards u = (0, i): 
Now we compute the Hessian matrix of (u) using the formula in (7.4). This expression 
is very large and this is why we chose not to display it. Instead, we evaluate it directly at 
(u1, u2, €) = (0, s, s), where s > 0, and thereafter we let s — 0. This yields the matrix 


A=] oo 


20) 


We see from Theorem 7.21 (b) that the (signed) curvature of the parabola at x = (0,0) is 
tAt' = —2, where t = (1,0) is the tangent direction of X at x. Here, the negative sign 
arises because the normal field points “inwards”, which causes the derivative of the normal 
field to point in the opposite direction; this can be seen in Figure 6.2. 

It was shown in Example 7.3 that the reach of X is 5. This confirms Theorem 7.8, which 
here states that the reach of the parabola is the inverse of its maximal curvature C(X). 
Indeed, a parabola has no real bottlenecks, and its curvature is maximal at the apex. © 
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Voronoi Cells | updates _ 


Every real algebraic variety X determines a Voronoi decomposition of its ambient Euclidean 
space R”. This is a partition of R” into Voronoi cells, one for each point in X. The Voronoi 
cell of y € X is the set of points in R” whose nearest point on X is y. Hence, u is in the 
Voronoi cell of y if d(u, X) = d(u, y). The Voronoi cell is a convex, semialgebraic set in 
the normal space of X at y. Most readers are familiar with the case when X is a finite set. 
In this chapter, we study Voronoi cells of varieties, with a primary focus on their algebraic 
boundaries. We consider them only for the Euclidean metric, but it also makes much sense to 
study Voronoi cells under the Wasserstein distance from Chapter 5 [16] or Kullback—Leibler 
divergence from Chapter 11 [5]. As before, (+, -) denotes the Euclidean inner product. 


8.1 Voronoi Basics 


Let X be a finite subset of R”. The Voronoi cell of y € X collects all points whose closest 
point in X is y. Writing d(u, x) = ||u — x|| for the Euclidean norm, the Voronoi cell equals 


Vorx(y) := {u € R” | y € arg min,-y d(u, x) }. (8.1) 


The study of these cells, and how they depend on the configuration X, is ubiquitous in 
computational geometry and its numerous applications. We begin with the fact that the 
Voronoi cell is a (possibly unbounded) convex polyhedron with at most |X| — 1 facets. 


Proposition 8.1 The Voronoi cell of a point y in the finite set X C R” is the polyhedron 
Vorx(y) = {u eR” | (ux-y) < 5(IIxll?—Ilyll?) forall xe X\{y}}. 8-2 


Proof By definition, the Voronoi cells Vorx(y) consists of all points u such that d(u, x) > 
d(u, y) for allx € X\{y}. This is equivalent to ||u—x||* —||u—y||? being nonnegative. But, 
this expression is equal to ||x||? — ||y||? — 2 (u, x — y). The main point is that the quadratic 
term drops out, so the expression is linear in u. oO 


The collection of Voronoi cells, as y ranges over the set X, is also known as the Voronoi 
diagram of X. This is a polyhedral subdivision of R” into finitely many convex cells. 
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We now shift gears, and we replace the finite set X by a real algebraic variety of positive 
dimension. Thus, let X be a real algebraic variety of codimension c in R”, and consider a 
point y € X. The Voronoi cell Vorx(y) is defined as before. It consists of all points u in R” 
such that y is closer or equal to u than any other point x € X. Equation (8.2) still holds, and 
we conclude that Vorx(y) is a convex set. We seek the Voronoi diagram {Vorx(y) yex in 
R” where y runs over all (infinitely many) points in the variety X. 


Proposition 8.2 If y is a smooth point of the variety X, then its Voronoi cell Vorx(y) is a 
convex semialgebraic full-dimensional subset of the c-dimensional affine normal space 


Nx(y) := y+NyX = {u € R” | u-y is perpendicular to the tangent space of X at y}. 


Proof Fix u € Vorx(y). Consider any point x in X that is close to y, and set v = x — y. The 
inequality in (8.2) implies (u, v} < 5 (lly + vl? — |lyll?) = (y, v) + zilvi. For any w in 
the tangent space of X at y, there exists v = ew + O (£?) such that x = y + v is in X. 

The inequality above yields (u, w) < (y, w}, and the same with —w instead of w. Then 
(u-y, w) = 0, and hence uis in the normal space Nyx (y). We already argued that Vorx (y) is 
convex. It is semialgebraic, by Tarski’s Theorem on Quantifier Elimination. This allows us 
to eliminate x from the formula (8.2). Finally, the Voronoi cell Voryx(y) is full-dimensional 
in the c-dimensional space Nx(y) because every point u in an €-neighborhood of y has a 
unique closest point in X. Moreover, if u € Nx(y) and |u—y| < €, then y must be the point 
in X that is closest to u. oO 


One approach to understanding Voronoi cells of a variety X is to take a large but finite 
sample from X and consider the Voronoi diagram of that sample. This is a finite approxi- 
mation to the desired limit object. By taking finer and finer samples, the Voronoi diagram 
should converge nicely to a subdivision with infinitely many lower-dimensional regions, 
namely the Voronoi cells Vorx(y). This process was studied by Brandt and Weinstein 
in [25] for the case when n = 2 and X is acurve. An illustration, similar to [25, Figure 1], 
is shown in Figure 8.1. Note that, for n > 3, the Voronoi cells are generally not polyhedra. 

Theorem 1 in [25] states that under some mild hypothesis, the limit of the Voronoi 
cells in this process converges indeed to the Voronoi cells Vorx(y). The authors posted 
a delightful YouTube video, called Mathemaddies’ Ice Cream Map. Please do watch that 
movie! Their curve X is the shoreline that separates the city of Berkeley from the San 
Francisco Bay. One hopes to find many ice cream shops at the shore. 


The topological boundary of the Voronoi cell Vorx(y) in the normal space Nx(y) is 
denoted by 0Voryx(y). It consists of all points in Ny(y) that have at least two closest points 
in X, including y. Note that 0Vorx(y) is contained in the intersection of Nx(y) with the 
medial axis Med(X) that was introduced in Chapter 7. We are interested in the algebraic 
boundary 0,.Vorx(y). This is a hypersurface in the complex affine space Nx(y)c = C9, 
defined as the Zariski closure of 0Vorx(y). For the sake of algebraic consistency, here the 
Zariski closure should be computed over the field of definition of X (cf. Example 8.5). 


Definition 8.3 The degree of the algebraic boundary of the Voronoi cell at y € X is denoted 


vx(y) := deg(dniVorx(y)). 
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Fig. 8.1: We see the quartic curve X = {x4 — x? y? + y4 — 4x? — 2y? — x — 4y + 1 = 0}, called butterfly 
curve or tooth curve. In each picture, the grey lines are the boundaries of the Voronoi diagram of a finite 
sample on or near the curve X. From left to right, the number of points in the sample increases. The 
Voronoi diagram becomes finer and approaches the Voronoi diagram of the curve. Note how the medial 
axis Med(X) becomes visible here. For the reach t(X) see [25, Example 6.1]. 


The positive integer vx(y) is called the Voronoi degree of X at y. If X is irreducible and y 
is a general point on X, then the Voronoi degree vx(y) does not depend on the choice of y. 
We call that integer the Voronoi degree of the variety X. 


Example 8.4 (Curves in 3-space) Let X be a general algebraic curve in RÌ. For y € X, the 
Voronoi cell Vorx(y) is a convex set in the normal plane Nx(y). Its algebraic boundary 
Oa1gVorx (y) is a plane curve of degree vx(y). This Voronoi degree can be expressed in 
terms of the degree and genus of X. Specifically, this degree is 12 when X is the intersection 
of two general quadrics in RÌ. Figure 8.2 shows one such curve X together with the normal 
plane at a point y € X. The Voronoi cell Vorx(y) is the planar convex region highlighted 
on the right. Its algebraic boundary ôagVorx(y) is a curve of degree vx(y) = 12. The 
topological boundary 0Vorx(y) is only a small subset of that algebraic boundary. © 


Example 8.5 (Surfaces in 3-space) Fix a general polynomial f € Q[x1,x2,x3] of degree 
d > 2and let X = V(f) be its surface in RÌ. The normal space at a general point y € X is 
the affine line Nx(y) = {y+4-Vf(y) | 4 € R}. The Voronoi cell Vory(y) is a (possibly 
unbounded) line segment in Nx(y) that contains y. The topological boundary 0Vorx(y) 
consists of at most two points from among the zeros of an irreducible polynomial in Q[A]. 
We shall see that this univariate polynomial has degree d? + d — 7. Its complex zeros 
form the algebraic boundary ðagVorx(y). Thus, the Voronoi degree of the surface X is 
d? +d -—7. Note that, in this example, our hypothesis “over the field of definition” becomes 
important. Here, the field of definition is Q. The Q-Zariski closure of one boundary point 
is the collection of all d? + d — 7 points in OaigVOrx (Y). © 


Example 8.6 (Quadrics in 3-space) We illustrate Example 8.5 in the case y = (0,0, 0) and 


d = 2. We consider f = x? 4x5 +x —3x1X2 —5X1X3 — 7x2x3 + X1 +x2 +x3. Let ro ~ —0.209, 


rı x —0.107, r2 ~ 0.122 be the roots of the cubic polynomial 3684? +712? — 6A — 1. The 
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Fig. 8.2: A quartic space curve, shown with the Voronoi cell in one of its normal planes. 


Voronoi cell Vorx(y) is the line segment connecting the points (71, r1, r1) and (r2,1r2, r2). 
The topological boundary 0Vorx(y) consists of these two points, whereas the algebraic 
boundary ðagVorx(y) also contains (ro, ro, ro). The cubic polynomial in the unknown A 
was found with the algebraic method that is described in the next section. Namely, the 
Voronoi ideal in (8.4) equals Vor;(0) = (ui — U3, U2 — U3, 368u3 + Tuy — 6u3 — 1). This 
is a maximal ideal in Q[u1, u2, u3], and it defines a field extension of degree 3 over Q. © 


8.2 Algebraic Boundaries 


For any point in the ambient space, the ED problem asks the question “What point on the 
variety X am I closest to?” Another question one might ask is “How far do we have to 
get away from X before there is more than one answer to the closest point question?” The 
union of the boundaries of the Voronoi cells is the locus of points in R” that have more 
than one closest point on X. This is the medial axis Med(X). 

The distance from the variety to its medial axis, which is the answer to the “how far” 
question, is the reach of X. We have proved this in Proposition 7.2. The distance from a 
point y on X to the variety’s medial axis is called the local reach of X. We formally define 
the local reach in (15.1), where we use it in a theorem that tells us how to compute the 
homology of X from finite samples. This is relevant for topological data analysis. 

The material that follows is based on the article [50] by Cifuentes, Ranestad, Sturmfels, 
and Weinstein. We begin with the exact symbolic computation of the Voronoi boundary 
at y from the equations that define X. This uses a Grobner-based algorithm whose input is y 
and the ideal of X and whose output is the ideal defining ôagVorx (y). In the next section, 
we present formulas for the Voronoi degree vx(y) when X and y are sufficiently general 
and dim(X) < 2. The proofs of these formulas require some intersection theory. Thereafter, 
in Section 8.4, we study the case when y is a low-rank matrix and X is the variety of all 
matrices of bounded rank. This relies on the Eckart—-Young Theorem (Theorem 2.9). 
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We now describe Grébner basis methods for finding the Voronoi boundaries of a given 
variety. We start with an ideal J = (fi, fo,..-, fm) in Q[%1,...,X,] whose real variety 
X = V(I) c R” is assumed to be non-empty. We assume that / is real radical and prime, 
so that the zero set of J in C” is an irreducible variety whose real points are Zariski dense. 
Our aim is to compute the Voronoi boundary of a given point y € X. In our examples, 
the coordinates of y and the coefficients of the polynomials f; are rational numbers. Under 
these assumptions, the following computations can be done in polynomial rings over Q. 

Fix the polynomial ring R = Q[x1, ..., Xn, U1, ..., Un] where u = (u1,..., Uun) is an 
auxiliary point with unknown coordinates. As in Chapter 2, the augmented Jacobian 
AJT (x,u) of X at x is the (m + 1) x n matrix with entries in R that is obtained by adding 
the row vector u — x to the Jacobian matrix (0 f;/0x;). 

Let Nz denote the ideal in R generated by J and the (c + 1) x (c + 1) minors of the 
augmented Jacobian matrix A.J (x, u), where c is the codimension of the variety X c R”. 
If X is smooth, then the ideal Nz in R defines a subvariety of dimension n in R*”, namely 
the Euclidean normal bundle of X. Its points are pairs (x, u) where x is a point in the given 
variety X and u lies in the normal space of X at x. 

In Chapter 2, we have worked with the critical ideal. The ideal Ny is similar to the 
critical ideal, but the key difference is that now u is a vector of variables. In both cases, 
when X is singular, we may wish to saturate with respect to the ideal of the singular locus 
of X. In what follows, for any y € X, let N;(y) denote the linear ideal of the normal space. 
This is obtained from Nz by replacing the unknown point x by the specific point y. 


Example 8.7 (Cuspidal cubic) Let n = 2 and I = (x} — x5), so X = V(I) c R? is a cubic 
curve with a cusp at the origin. The ideal of the Euclidean normal bundle of X is generated 
by two polynomials: 


Ny = (Å -a det] “SS |) © R = Ql miu. (8.3) 
Consider the normal line of X at y = (4, 8). Its ideal is N7(y) = (uy + 3u2 — 28). o 


Returning to the general setting, we define for a fixed point y the following ideal: 
Ciy) = Nr + Niy) + (lx- ul? - lly- ull) c R. 


The real variety of C;(y) lives in R?”. It consists of all pairs (u, x) such that x and y are 
equidistant from u and both are critical points of the distance function from u to X. 


Definition 8.8 The Voronoi idealin Q[u1, ..., un] is obtained from the ideal Cz (y) defined 
above by saturation and elimination, as follows: 


Vorr(y) = (Cry) : &-y)") A Qiu... unl. (8.4) 
The geometric interpretation of each step in our construction implies the following result: 


Proposition 8.9 The affine variety in C” defined by the Voronoi ideal Vor;(y) contains 
the algebraic Voronoi boundary ðaVorx(y) of the given real variety X at its point y. 
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Remark 8.10 The verb “contains” sounds weak, but it is much stronger than it may seem. 
Indeed, in generic situations, the ideal Vor; (y) is prime and defines an irreducible hypersur- 
face in the normal space of X at y. This hypersurface equals the algebraic Voronoi boundary, 
so containment is an equality. We saw this in Examples 8.5 and 8.6. The ideal Vor;(y) 
usually defines a hypersurface in the normal space. For special data, it can have extraneous 
components, but these are easy to identify and remove when the dimension is low. 


Example 8.11 We consider the cuspidal cubic X = V(I) c R? in Example 8.7, where 
I= E -— x2 ), and we fix the point y = (4,8) € X. Going through the steps above, we find 
that the Voronoi ideal equals 


Vors(y) = (u1—28, uz) N (ui +26, w2—-18) N (u1+3u2—28, 27u3-4861u2+2197). (8.5) 


The third component has no real roots and is hence extraneous. The Voronoi boundary 
consists of two points. Namely, we have OVorx(y) = {(28, 0), (—26, 18)}. The Voronoi 
cell Vorx(y) is the line segment connecting these points. This segment is shown in green 
in Figure 8.3. Its right endpoint (28,0) is equidistant from y and the point (4, —8). Its left 
endpoint (—26, 18) is equidistant from y and the origin (0,0), which is the singular point 
of the curve X. Its Voronoi cell will be discussed in Remark 8.12. 

The issue of saturation is subtle and interesting in this example. In (8.3), we did not 
saturate the ideal N7, and this led to the three components in (8.5). By contrast, suppose we 
saturate by the singular point of our curve, i.e. we replace Nz by Nz : (x1,x2)°. Then (8.4) 
yields a stricly larger ideal than in (8.5). The second prime ideal is gone, and Vor;(y) is only 
the intersection of the first and third prime ideals. Geometrically, we are losing the point 
(—26, 18). This makes sense because that point has the same distance to (4, 8) and to (0,0). 
The saturation step has removed the singular point (0, 0) from our algebraic representation. 
The resulting Voronoi ideal only sees pairs of smooth points that are equidistant. 


Fig. 8.3: The cuspidal cubic is shown in red. The Voronoi cell of a smooth point is a green line segment. 
The Voronoi cell of the cusp is the blue convex region bounded by the blue curve. 
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The cuspidal cubic X is very special. If we replace X by a general cubic (defined over Q) 
in RŽ, then Vor;(y) is generated modulo N;(y) by an irreducible polynomial of degree eight 
in Q[u1, u2]. Thus, the expected Voronoi degree for general plane cubics is vx(y) =8. © 


Remark 8.12 (Singularities) Voronoi cells at singular points can be computed with the 
same tools as above. However, these Voronoi cells can have higher dimensions. For an 
illustration, consider the cuspidal cubic, and let y = (0,0) be the cusp. A Grébner basis 
computation yields the Voronoi boundary 27u5 + 128u} + 72uy us + 32u? + u% + 2u1. The 
Voronoi cell is the two-dimensional convex region bounded by this quartic, shown in blue 
in Figure 8.3. The Voronoi cell can also be empty at a singularity. This happens for instance 
for VG + x? - x3), which has an ordinary double point at y = (0,0). In general, the cell 
dimension depends on both the embedding dimension and the branches of the singularity. 


Proposition 8.9 gives an algorithm for computing the Voronoi ideal Vor; (y) when y is 
a smooth point in X = V(I). Experiments with Macaulay2 [73] are reported in [50]. For 
small instances, the computation terminates and we obtain the defining polynomial of the 
Voronoi boundary ðagVorx(y). This polynomial is unique modulo the linear ideal of the 
normal space N;(y). For larger instances, we can only compute the degree of 0,1gVorx(y) 
but not its equation. This is done by working over a finite field and adding c — 1 random 
linear equations in u1, ..., Un in order to get a zero-dimensional polynomial system. 

Computations are easiest to set up for the case of hypersurfaces (c = 1). One can 
explore random polynomials f of degree d in Q[x,,...,x,], both inhomogeneous and 
homogeneous. These are chosen among those that vanish at a preselected point y in Q”. 
In each iteration, the Voronoi ideal Vor;(y) from (8.4) was found to be zero-dimensional. 
In fact, Vory(y) is a maximal ideal in Q[u1, . . . , un], and the Voronoi degree vx(y) is the 
degree of the field extension of Q that is defined by that maximal ideal. 

We summarize our results in Tables 8.1 and 8.2, and we extract conjectural formulas. 


n\d|2 3 4 5 6 7 8 vx(y) = degree( dag Vors) (y)) 
1 |1 2 3 4 5 6 7 d-1 

2/2 8 16 26 38 52 68 d’+d-4 

3 |3 23 61 123 215 343 d+d-7 

4 |4 56 202 520 1112 d*—d?+d°+d-10 

5 | 5 125 631 d@—2d*+2d3+d-13 

6 | 6 266 1924 d°—3d°+4d+*—2d3+d°+d-16 
7 | 7 551 d’—4d°+7d>-6d*+3d>+d-19 


Table 8.1: The Voronoi degree of an inhomogeneous polynomial f of degree d in R”. 


Conjecture 8.13 The Voronoi degree of a generic hypersurface of degree d in R” equals 
(d-1)" +3(d-1)""! + Aj ((d-1)""! - 1) -3n. 
Conjecture 8.14 The Voronoi degree of a generic homogeneous polynomial of degree d is 


2(d-1)"1 + 5 ((d-1)"! = 1) - 3n+2. 
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n\d|2 3 4 5 6 7 8 vx(y) =degree(Aug Vor) (y)) 
2 |2 4 6 8 10 12 14 2d-2 
3 |3 13 27 45 67 93 123 2d?-5 
4 |4 34 96 202 2d>-2d?+2d-8 
5 | 5 79 309 2d*-4d3+4d?-11 
6 | 6 172 2d>-6d* + 8d>—-4d?+2d-14 
7 |7 361 2d°-8d>+14d*—12d?+6d?-17 


Table 8.2: The Voronoi degree of a homogeneous polynomial f of degree d in R”. 


Both conjectures are proved for n < 3 in [50, Section 4], where the geometric theory of 
Voronoi degrees of low-dimensional varieties is developed. The case d = 2 was analyzed 
in [49, Proposition 5.8]. For n > 4 and d > 3, the conjectures remain open. 


8.3 Degree Formulas 


To recap, the algebraic boundary of the Voronoi cell Vorx(y) is a hypersurface in the 
normal space to a variety X c R” at a point y € X. If y has rational coordinates, then 
that hypersurface is defined over Q. We shall present formulas for the degree vx (y) of that 
hypersurface when X is a curve or a surface. All proofs appear in [50, Section 6]. We identify 
X and 0,)gVorx (y) with their Zariski closures in complex projective space P”, so there is a 
natural assigned hyperplane at infinity. We say that the variety X is in general position in P” 
if the hyperplane at infinity intersects X transversally. The next result is [50, Theorem 5.1]. 


Theorem 8.15 Let X c P” be a curve of degree d and geometric genus g with at most 
ordinary multiple points as singularities. If X is in general position, then the Voronoi 
degree at a general point y € X equals 


vx(y) = 4d+2g¢-6. 

Example 8.16 If X is a smooth curve of degree d in the plane, then 2g — 2 = d(d — 3), so 
yx(y) = d+d-4. 

This confirms our experimental results in the row n = 2 of Table 8.1. © 


Example 8.17 If X is a rational curve of degree d, then g = O and our formula gives 
vx(y) = 4d — 6. If X is an elliptic curve (g = 1), then vx(y) = 4d — 4. A space curve with 
d = 4 and g = 1 was studied in Example 8.4. Its Voronoi degree equals vx(y) = 12. © 


The general position assumption in Theorem 8.15 is essential. For an example, let X be 
the twisted cubic curve in P3, with affine parameterization t > (t, t?, t). Here g = 0 and 
d = 3, so the expected Voronoi degree is 6. However, a computation shows that vx (y) = 4. 
This drop arises because the plane at infinity in P? intersects the curve X in a triple point. 
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After a general linear change of coordinates in P?, which amounts to a linear fractional 
transformation in R?, we correctly find vx(y) = 6. 

We next present a formula for the Voronoi degree of a surface X, which is smooth and 
irreducible in P”. Our formula is in terms of its degree d and two further invariants. The first 
is the topological Euler characteristic 7(X). This equals the degree of the second Chern 
class of the tangent bundle; see Example 4.22 (b). The second invariant, denoted g(X), is 
the genus of the curve obtained by intersecting X with a general quadratic hypersurface 
in P”. Thus, g(X) is the quadratic analogue to the sectional genus of the surface X. 


Theorem 8.18 (Theorem 5.4 in [50]) Let X c P” be a smooth surface of degree d. Then 
vx(y) = 3d+x(X) +48(X) — 11, 


provided the surface X is in general position in P” and y is a general point on X. 


Example 8.19 If X is a smooth surface in P? of degree d, then (4.10) and Example 2.21 
yield y(X) = deg(c2(X)) = d(d* — 4d +6). A smooth quadratic hypersurface section of X 
is an irreducible curve of degree (d, d) in P! x P! with genus g(X) = (d — 1). Thus, 


vx(y) = 3d +d(d?-4d+6) + 4(d-1)? -11 = d?+d-7. 
This confirms our experimental results in the row n = 3 of Table 8.1. ‘ 


Example 8.20 Let X be the Veronese surface of order e in p(2)-! (see Definition 12.5), 
taken after a general linear change of coordinates in that ambient space. The degree of 
X equals d = e? (this is proved in Corollary 12.21). We have y(X) = y(P*) = 3, and 
the general quadratic hypersurface section of X is a curve of genus g(X) = en 1), We 
conclude that the Voronoi degree of X at a general point y equals 


vx(y) = 3e? +3 + 2(2e-1)(2e-2) - 11 = lle? -12e -— 4. 


For instance, for the Veronese surface in P3 we have e = 2 and hence vx(y) = 16. This is 
smaller than the number 18 to be found later in Example 8.25. That example concerns the 
cone over the Veronese surface in RÉ, and not the Veronese surface in R° c P5. © 


We finally consider affine surfaces defined by homogeneous polynomials. Namely, let 
X C R” be the affine cone over a general smooth curve of degree d and genus g in P”"!. 


Theorem 8.21 (Theorem 5.7 in [50]) Zf X is the cone over a smooth curve in P”! then 
vx(y) = 6d+4g-9, 


provided that the curve is in general position and y is a general point. 


Example 8.22 If X c R? is the cone over a smooth curve of degree d in P”, then we have 
2g — 2 = d(d — 3), by the degree-genus formula for plane curves. We conclude that 


vx(y) = 2-5. 


This confirms our experimental results on Voronoi degrees in the row n = 3 of Table 8.2. o 
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Let us comment on the assumption made in our theorems, namely that X is in general 
position in P”. If this is not satisfied, then the Voronoi degree may drop, assuming it 
remains zero-dimensional. Indeed, the Voronoi ideal Vor;(y) depends polynomially on the 
description of X, and the degree of this ideal can only go down — and not up — when that 
description specializes. This follows from the Parameter Continuation Theorem 3.18. 


8.4 Voronoi meets Eckart-Young 


We now turn to a case of great interest in applications. Let X be the variety of real m x n 
matrices of rank at most r. We consider two norms on the space R’””” of real mxn matrices. 
Our first matrix norm is the spectral norm or operator norm ||U||op := max; 0j;(U) which 
extracts the largest singular value of the matrix U. Our second norm is the Frobenius norm 


We =f Day U7, = 4 Trace(UT U). The Frobenius norm agrees with the Euclidean norm 


on R”™”", so it fits into our setting. The case of symmetric matrices will be discussed later. 

Fix a rank r matrix V in X. This is a nonsingular point in X. Consider any matrix U in the 
Voronoi cell Vorx(V). This means that the closest point to U in the rank r variety X relative 
to the Frobenius norm is the matrix V. By the Eckart-Young Theorem (Theorem 2.9), the 
matrix V is derived from U by computing the singular value decomposition U = X DX. 
Here £; and 2 are orthogonal matrices of size m X m and n x n, respectively, and D is a 
nonnegative diagonal matrix whose entries are the singular values. Let D!”! be the matrix 
that is obtained from D by replacing all singular values except for the r largest ones by 
zero. Then, according to Eckart-Young, we have V = Xj - pil. Xp. 


Remark 8.23 The Eckart-Young Theorem works for both the Frobenius norm and the 
spectral norm. This means that Vorx(V) is also the Voronoi cell for the spectral norm. 


The following theorem describes the Voronoi cells for low-rank matrix approximation. 


Theorem 8.24 Let V be an m X n-matrix of rank r. Let u be the r-th singular value of V. 
The Voronoi cell Vorx(V) is the ball of radius u in the spectral norm on the space of 
(m —r) x (n — r)-matrices. 


Proof Since the Frobenius norm is orthogonally invariant, we can assume that the matrix 
V = (vij) € X is a diagonal matrix whose entries are vj, > v22 > ++: > Vyr =u > 0. These 
entries are the singular values of V. The normal space of the determinantal variety X is 
described in Lemma 9.12. In particular, the normal space of X at V consists of the matrices 


where U is an arbitrary (m — r) x (n — r) matrix. The matrix V is the rank r matrix closest 
to V + A if and only if the spectral norm of U is less than u. This shows that the Voronoi 
cell Vorx(V) is a full-dimensional convex body in the normal space, which we identified 
with the matrices U. Namely, Vorx(V) equals u times the unit ball in RO"-")*'-”) under 
the spectral norm. o 
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Fig. 8.4: The Voronoi cell of a symmetric 3x3 matrix of rank 1 is a convex body of dimension 3. It is 
shown for the Frobenius norm (left) and for the Euclidean norm (right). See Example 8.25 for a discussion. 


The proof of Theorem 8.24 offers the following perspective on the Voronoi cells Vory(V) 
of the determinantal variety X. Suppose m = n. Consider the set Z of vectors in R” that 
have at most r non-zero coordinates. This is a reducible variety with (”) components, each 
a coordinate subspace. For a general point y in such a subspace, the Voronoi cell Vorz(y) 
is a convex polytope. It is congruent to a regular cube of dimension n — r, which is the unit 
ball in the L~-norm on R”". By Theorem 8.24, the Voronoi cell Vorx(V) is the orbit of 
that cube under the group of orthogonal transformations. To make sense of this action, we 
here identify R”7” with the space of diagonal matrices in R("-)*("~”), 

For example, consider the special case where n = 3 and r = 1. In this case, Z consists 
of the three coordinate axes in R*. The Voronoi decomposition of this reducible curve 
decomposes R? into squares, each normal to a different point on the three lines. The image 
of this picture under orthogonal transformations is the Voronoi decomposition of R**? 
associated with the affine variety of rank | matrices. That variety has dimension 5, and 
each Voronoi cell is a 4-dimensional convex body in the normal space. 

If u is the smallest singular value of V, then A - u is the smallest singular value of 
A-V, for every A # 0. If we wish to compare the sizes of Voronoi cells, then we should 
work with some normalization and consider the relative size of Voronoi cells. Let us 
normalize rank-r matrices such that their largest singular value is equal to one. That is, we 
intersect the variety X with the unit sphere in the spectral norm. Let V be a matrix in this 
intersection. Then, by Theorem 8.24 the size of its Voronoi cell is A = 0;(V)/o,(V). In 
Chapter 9 we will study this ratio of singular values. In particular, we will show in (9.4) 
that o, (V)/o1(V) is the inverse of the Turing condition number of V, which is related to 
matrix inversion. Therefore, the relative size of the Voronoi cell of V is the inverse of the 
relative condition number of V. There is a close connection between condition numbers 
and errors in numerical computation. We explain this in Chapter 9. 

Our problem is even more interesting when we restrict to matrices in a linear subspace. 
To see this, let now X denote the variety of symmetric n X n matrices of rank at most r. 
We can regard X either as a variety in the matrix space R’”, or in the space RC?) whose 
coordinates are the upper triangular entries. On the latter space we have the Euclidean 
norm and the Frobenius norm (aka Bombieri—Weyl norm). These are now different! 
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The Frobenius norm on RÈ”? ') is the restriction of the Frobenius norm on R’” to the 
subspace of symmetric matrices. For instance, if n = 2, we identify the vector (a, b, c) 
with the symmetric matrix [7 b |. The Frobenius norm of this matrix is Va2+2b2+c? 
(cf. Example 9.15). On the other hand, the Euclidean norm is Va2+b2+c2. The two norms 
have dramatically different properties with respect to low-rank approximation. The Eckart- 


Young Theorem remains valid for the Frobenius norm on RC ) , but it is not valid for the 
Euclidean norm. Some implications are explained in [60, Example 3.2]. In what follows, 
we elucidate this point by comparing the Voronoi cells with respect to the two norms. 


Example 8.25 Let X be the variety of symmetric 3 x 3 matrices of rank < 1. For the 
Euclidean metric, X lives in R°. For the Frobenius metric, X lives in a 6-dimensional 
subspace of R3*3, Let V be a smooth point in X, i.e. a symmetric 3 x 3 matrix of rank 1. 
The normal space to X at V has dimension 3. Hence, in either norm, the Voronoi cell 
Vorx(V) is a 3-dimensional convex body. Figure 8.4 shows these two bodies. 

For the Frobenius metric, each Voronoi cell of the Veronese variety X is congruent to 
the set of matrices [g B| whose eigenvalues are between —1 and 1. This semialgebraic 
set is bounded by the singular quadratic surfaces with defining polynomials det[ Sa f 1 | 
and det| at eu 1 hs The Voronoi ideal is of degree 4, and it is generated by the product of 
these two determinants (modulo the normal space). The Voronoi cell is shown on the left 
in Figure 8.4. It is the intersection of two quadratic cones. The cell is the convex hull of the 
circle in which the two quadrics meet, together with the two vertices. 

For the Euclidean metric, the Voronoi boundary at a generic point V in X is defined by 
an irreducible polynomial of degree 18 in a,b,c. This polynomial is found by the ideal 


computation in equation (8.4). In some cases, the Voronoi degree can drop. For instance, 
; : ; 100 ; : ; 
consider the special rank 1 matrix V = [o 0 o|. For this point, the degree of the Voronoi 


boundary is only 12. This particular Voronoi cell is shown on the right in Figure 8.4. This 
cell is the convex hull of two ellipses, which are marked in red in the diagram. © 
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Chapter 9 | A | 
Condition Numbers Chae for 


| updates 


The concept of a condition number has its origin in numerical analysis. It measures how 
much the output value of a function we wish to evaluate can change for a small change in 
the input argument. In this chapter, we discuss condition numbers in the context of metric 
algebraic geometry. In the first section, we offer an introduction to the relevant notions for 
assessing errors in numerical computations. 

Suppose the function to be evaluated is an algebraic function. For instance, we may wish 
to map the coefficients of a univariate polynomial to one of its roots. This function is well- 
defined locally, and it is well-behaved when the polynomial is far from the hypersurface 
defined by the discriminant. Indeed, the condition number stands in a reciprocal relationship 
to the distance to the variety of ill-posed instances. This variety is often a discriminantal 
hypersurface, and thus we are naturally led to the ED problem for discriminants. This topic 
will be studied in the third and last sections of this chapter. The classical discriminant is the 
dual variety to the Veronese variety, and other discriminants are dual to other toric varieties. 
We can apply ED duality (Theorem 2.23) to gain insight and computational speed. 

A special case of a discriminant is the determinant of a square matrix. This arises when 
our function is matrix inversion. The relevant ED problem points us to the Eckart—Young 
Theorem (Theorem 2.9). This is why we include the proof of Eckart—-Young in the second 
section, which is about the condition number of matrix inversion. 


9.1 Errors in Numerical Computations 


Input data for numerical algorithms can have errors. These may be caused by measurement 
errors. Hence, the output of the computation also has errors. We wish to compare the output 
error with the input error. This is a fundamental issue for numerical computations. 


Example 9.1 (Exact algorithm) Given a matrix A € R? with det(A) # 0, we want to 
compute the inverse matrix A~!. We consider two instances of this problem. The errors are 
measured by the Euclidean norm. 
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(a) First, let A = E ale We consider this matrix to be the true input data. A small 
measurement error gives the new input data A = See ae where 0 < € < 1. The 


exact solutions A`! and A7! are then 


1 |1 -1 ~ 1 1 -l 
-1—2 -1 _ 
A zli il; and A ae | 
E 1-1 
SA . 
* 3Q-6) |-1 | 


Comparing the errors, we find that ||A~! — A~!|| ~ ||A — All. In words, the error in the 
input ||A — A]| and the error in the output ||A~! — A~'|| are roughly the same for small 
values of £. They both are of the order O (e). 


(b) The true input in our second example is the matrix B = [ re rar where |6| # 0 is 
small. We perturb the input by adding € to the lower left entry. The perturbed input is 
B = | il, 11s |. The matrix inverses are 


1f1+6 -1 : 1 i651 
= oe A 
Talci | ae Boe te | 
a e [a+ 1 
d(e- ô) 1+6 -l]° 


This implies ||B7! — Š! || ~ e - ||B — Bll. If e < 6, then the error is amplified by 


a factor of roughly 6~*, which is large. The behavior here is different from that before. 


We applied an exact algorithm to the problem of matrix inversion, and we observed 
considerable differences in the output. In the first example, the output Â~! for the perturbed 
data A was close to the true output A7!. On the other hand, in the second example, the 
output B~! for the perturbed data Ë was far from the true output B~!. © 


The previous example shows that, even if we can compute the exact solution of a 
problem, small errors in the data may be amplified tremendously in the output. The theory 
of condition numbers helps us to understand when and why this happens. In simple terms, 
a condition number is a quantity associated with a computational problem, and it measures 
the sensitivity of the output to small errors in the input data. 


Definition 9.2 A computational problem is a function ¢ : M — N from a space M of 
inputs to a space N of outputs. For us, each space is a subset of a Euclidean space, and it 
carries the induced Euclidean metric. 


Example 9.3 In Example 9.1, the input space and the output space are both given by 
M =N = {A € R**? | det(A) + 0}. The computational problem is matrix inversion, so 
the relevant function is (A) = A7!. ° 


Let (M, dy) and (N, dy) be metric spaces. The following definition is due to Rice [152]. 
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Definition 9.4 The (absolute) condition number of ¢ : M —> N at the input p € M is 


«[¢](p) := lim Srp du(p.q) ` 


dy (p.q) Se 


The motivation for this definition is as follows: For small dm (p, q) we have 


dn(¢(p),¢(q)) < k[¢](p) - du (p,q) + o(du(p,q)). 


In words, a small error € = dm (p, q) in the input data causes an error of roughly x[¢](p)-¢ 
in the output data. This relationship is entirely independent of the algorithm that is used 
to evaluate #(p). At this stage, given the lim-sup definition, it is unclear how condition 
numbers can be computed. As we shall see, this is where the geometric perspective of 
Demmel [55] comes in. But, let us first discuss an important variant. 


Remark 9.5 Fix the Euclidean spaces M = R” and N = R”. What does “small error” mean 
in this case? If ||p|| = 104, is an error of size ||p — q|| = 10? small or large? To address 
a question like this, there is a notion of relative error in numerical analysis. By definition, 
the relative error between p and q is 


RelError(p,q) := Ip -al for p,q € M. 


Ilpll 


The relative condition number of the function ¢ at the input datum p € M is as follows: 


eee ta si, EO) = wii) 


£0 RelError(p,q)<e RelError(p, q) 


plim 
ON 


In numerical analysis, relative errors are more significant than absolute errors because 
floating-point arithmetic introduces relative errors (see, e.g., [89] or [173, p. 91]). Modern 
architectures are optimized for computing with floating-point numbers. A floating-point 
number system F is a subset of the real numbers R that is specified by four integers 
B,t, mins €max, Where $ is called base, t is called precision, and [emin, €max] is called 
exponential range. Then, F := {+P° a a |O < di < B-1, emin < € < Cmax}. The 
quantity u = 5 B!™ is referred to as the relative precision of F. 

The range of F is the set G := {x € R | Bemm! < |x| < Bem (1 — B7!)}. Note that 
F C G. Consider the rounding function fl: R > F,x +> argmin,.¢|x — y|. One can 
show that every x € G satisfies fl(x) = x(1+ 6) € F for some 6 with |6| < u. This is a 
crucial property. It tells us that every number in G can be approximated by a number of F 


with relative precision u. Namely, the following inequality holds: 
RelError (x, fi(x)) = ||| <u foral x eG. (9.1) 


Many hardware floating-point units use the IEEE 754 standard. This is a system F with 
the specifications 
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B t emn Cmax u 
half (16 bit) 211-14 16=2f x 5.1074 
single (32 bit) |2 24 -125 128=27 ~6-10°% 
double (64 bit)| 2 53 —1021 1024 =2!° ~ 107!6 


This is designed so that the arithmetic operations o € {+, —, X, 7, y} satisfy the property 
fl(x o y) = (xo y)(1 +6) for some |6| < u. For instance, the 64-bit floating-point number 
system, specified in the third row of the table, can approximate any real number within its 
range with a relative error of at most u ~ 107!6, 

After this digression into the practical aspects of numerical computing, we now return 
to the mathematical theory. The next theorem is also due to Rice [152]. In his article, M 
and N are arbitrary Riemannian manifolds. We here specialize to the algebraic setting. For 
us, each of M and N is a submanifold in a Euclidean space, described by a finite Boolean 
combination of polynomial equations and inequalities. 


Theorem 9.6 Fix a differentiable function ¢ : M — N. The condition number of this 
computational problem at the input datum p € M is the maximal norm of the derivative 
over the unit sphere in the tangent space at p; i.e., 


k[ġ](p) = |Dpd(t)|l. 


max 
teTpM: ||t||=1 


We obtain the relative condition number KkgeL|ġ](p) by multiplying this with ||p||/||¢(p) ||. 


A key step in computing the condition number with Rice’s formula is to find an ex- 
pression for the Jacobian Dp@. For problems of interest to us, this step uses implicit 
differentiation or geometric arguments. 


Example 9.7 (Roots of univariate polynomials) Following [35, Section 14.1.1], we ex- 
amine the computational problem of finding one real root of a polynomial f in one variable 
z of degree d. We write 


FÈ = fot fiz+ fetes + faz’. 


The coefficient vector f = (fo, fi,..-, fa) now serves as the input p, and the output is a 
particular real number a which satisfies f(a) = 0. The function f + a( f) is well defined 
in a small open subset U of the coefficient space R¢*!. The set U must be small enough so 
that one root can be identified for each polynomial in U. 

To find the derivative D fa of our root-finding function f +> a( f), we assume that the 
coefficients are differentiable functions f;(t) of a parameter t, and we set f = f; (0) and 
f= Zi fizi. By differentiating the identity f(a(f)) = 0 with respect to t, we find the 
following formula for the desired derivative: 


_ fa) 
f'(a) 
We now think of a as a fixed root of a fixed polynomial f(z). Theorem 9.6 implies that 
the condition number x[a] (f) equals | f’(a)|~! times the maximal value | f(a)|, where f 


Dfa = 


(9.2) 
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runs over all points (fo, fi,-.., fa) on the unit d-sphere. The Cauchy-Schwarz inequality 
implies that this maximum equals | f(a)| = 4/ os a”, and therefore we have the following 


formula for the desired condition number: 
| EL qi 
Ial - 


This quantity is +co when a is a double zero of f. The further away from being a double 
zero, the smaller the condition number. The relative condition number for the univariate 
root-finding problem equals 


klal(f) = (9.3) 


yf (Sto F(T q a) 
lal Fa 


This formula involves not just the polynomial f(z) but also the zero a we seek to find. This 
is due to the fact that a( f) is a local map, defined locally around the zero a; see [93]. © 


KreLlal(f) = 


Example 9.8 (The condition number of the ED minimization problem) We revisit 
the ED minimization problem (2.1) for a smooth algebraic variety X c R” from the 
point of view of condition numbers. Recall the definition of the medial axis Med(X) 
from Chapter 7. Every point outside the medial axis has a unique closest point on X. 
Formulating this as a computational problem, we have the input space M = R”\Med(X) 
and the output space N = X. Here, ¢ : (R”\Med(X)) — X is the closest-point function 
¢(u) = argmin, <x ||x—ul]. We can apply Theorem 9.6 to compute the associated condition 
number x[¢] (u). This was carried out in detail in [32]. Suppose x = #(u). We have 


k[ġ] (u) || Um —A+ Ly)" tl], 


= max 
teTX: |It||=1 


where 4 = ||u — x|| is the distance from u to X, v = 47! (u — x) is the normal vector at x 
pointing towards u, and Ly is the Weingarten map (6.10) of X at x in normal direction v. © 


9.2 Matrix Inversion and Eckart-Young 


In this section, we study the condition number for matrix inversion. This will lead us back 
to the Eckart-Young Theorem, for which we here present a proof. We begin by reviewing 
some norms on the space of real m Xn matrices. The Euclidean norm (or Frobenius norm) is 


Al = ye = ytrace(AA™) for A e€ R™"”, 
i,j 


By contrast, in Theorem 9.6, we used the spectral norm, or operator norm. This is given by 


|Allop := max ||Ax|l. 
I[x||=1 
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We have already met the spectral norm in Section 8.4. There, we introduced it as the 
largest singular value of A. To see that this is an equivalent definition, we first observe 
that, if U and V are orthogonal matrices, then ||U" AV|lop = ||Allop- In words, the spectral 
norm is orthogonally invariant. Suppose m > n. Let A = UDV" be the singular value 
decomposition of A, where U € R™"™™ and V € R”*" are orthogonal matrices and D is 
the m x n diagonal matrix with singular values o4 > --- > on 2 0 on the main diagonal. 
Orthogonal invariance implies ||A]lop = ||D|lop = o1. Similarly, the Frobenius norm is 
orthogonally invariant and hence satisfies ||A|| = ||D|| = (of +-+ c2)! 2. Furthermore, 
in the case where n = m and A is invertible, we have A~! = VD~!U", so that 


Alen = o}. 


We now focus on the case of square matrices (m = n). Matrix inversion is the map 
inv: D > D, Am Al, 


where D = {A € R?” | det(A) + 0}. We shall prove the following characterization of the 
condition number of matrix inversion. For any A € D, the smallest singular value o, is a 
positive real number. 


Theorem 9.9 The condition number of matrix inversion at A € D is 
. -14)2 2 
k[inv](A) = ATR, = 072. 


Before we prove this theorem, let us briefly bring it into context with Remark 9.5. In 
numerical analysis, a popular choice for measuring the relative error is using the spectral 
norm. In this case, by Theorem 9.9, the relative condition number can be expressed as the 
ratio of the largest and smallest singular value of A: 


ILA lop Ti 
ILA! llop On 


kgeLlinv] (A) = «[inv](A) - (9.4) 


This ratio is Turing’s condition number and goes back to the work of Turing [174]. The 
meaning of (9.4) for Voronoi cells of determinantal varieties was discussed in Section 8.4. 


Proof (of Theorem 9.9) Let adj(A) denote the adjoint matrix of A. Since 


f -1 1 f 

inv(A) =A = REE - adj(A), 
the map inv is a polynomial function on the open set D. In particular, inv is differentiable 
on D. By Theorem 9.6, the condition number is x[inv](A) = ||Dainv|lop. We compute 
the derivative of inv. Taking the derivative of AB = I, we have AB + AB = 0. Since 
B = A“', we conclude B = —A~!AA™!. For a tangent vector A = R € R®” we therefore 
have that Dainv(R) = A~'RA™!. Let A = UDV” be the singular value decomposition 
of A. Then, A~! = VD~!U". By orthogonal invariance of the Euclidean norm, we find 


max || ATI RA} ||? = max |D! RD! ||? = max D zL. 
IIR |=! RIl=1 Lj TijT 
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The last expression is maximized when r,,, = 1 and all other r;,; are zero. We conclude 
that «[inv](A) = maxyry=1 ||Dainv(R)|| = = = An hop is the condition number of 
matrix inversion at A. o 
Example 9.10 We revisit Example 9.1. The matrices were A = | |, 1] and B = |! 15]. 
To be concrete, we set 6 = 1078. Since ATA = 2- h, we have that ||A~!x|| = ales for 
all x € R?. By Theorem 9.9, x[inv](A) = Aw Mop = +. On the other hand, we have 
IIB- llop = ||/B~'eo|| > 108, and this implies «[inv](B) = IB- > 10!°. This explains 
the different behaviors of the outputs with respect to errors in the input in Example 9.1. o 


The Eckart-Young Theorem (Theorem 2.9) yields a metric interpretation of Turing’s 
condition number «x [inv] (A) from Theorem 9.9. The smallest singular value o, of a square 
matrix A equals the Euclidean distance of A to the variety of singular matrices. This is the 
hypersurface defined by the determinant: 


x := {Ae R” | det(A) = 0}. 
Turing’s condition number is the squared inverse distance from A to the hypersurface =: 


Il Allop 


K [inv] (A) = dist(A, 5) - 


and xkrec[inv](A) = (9.5) 


dist(A, £)2 


Example 9.11 Fix the 2x2-determinant © = V (det) c R? and consider the set of matrices 
A € R® for which x[inv](A) = e7!. This is the offset hypersurface of £ at level Ve. 
We can compute this as in Example 7.14. In what follows we compute the hypersurface 
defined by «grer [inv] (A) = «7! for e > 0. To make the formula in (9.5) more convenient, 
we replace ||Allop by ||A||. We proceed as in Section 7.2 to compute a polynomial equation 
for dist(A, £) = €- ||A|| in terms of A and e, but for B € È we replace the affine sphere 
||A — B|| = £ by the homogeneous sphere ||A — B|| = €- ||A||. We use Macaulay2 [73]: 


R = QQ[a_0..a_3, b_0..b_3, eps]; 

f = b_O*b_3 - b_1*b_2; 

nAsq = a_042 + a_142 + a_242 + a_342; 

d = (a_0-b_0) *2+(a_1-b_1) 42+(a_2-b_2)42+(a_3-b_3)42-eps’2*nAsq; 


J1 = {diff(b_0, f), diff(b_1, f), diff(b_2, f), diff(b_3, f£)}; 
J2 = {diff(b_0, d), diff(b_1, d), diff(b_2, d), diff(b_3, d)}; 
J = matrix {J1, J2}; 


OC = ideal {f, minors(2, J), dł; 
O = eliminate({b_0, b_1, b_2, b_3}, 00); 
g = sub((gens 0)_0_0, QQ[a_0..a_3, b_0..b_3][eps]) 


The result of this computation is a polynomial of the form 


g(a, £) = (ag + aî +a% +43) - (go(a) + gi (a)? + g2(aye* + 93(a)e® + ga(a)e®). 
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The coefficients of this relative offset polynomial are 
EY ee eC 
g4(a) = (ap + aĵ + a5 + a3) 
N U E 
83 (a) = -3 (aĵ + aj + a5 + a3) 
g(a) = 3a4 + 6aĝa? + 3af + 6a,a, + Tata} + 3ay — 2aga,aza3 + Tan az + 6a?a? + 6a3a3 + 3a3 


2 
g(a) = —(aj +2aĝa? + af + 2a,a; + data; + ay 4aga,aza3 +4 4ajaz + 247a? + 2asa; + a) 


g(a) = (ajaz - agaz)’. 


Figure 9.1 shows the zero set of g(a, £) at level e = 0.5 in the affine patch ap = 1. We 
remark that, if dist(A, £) = € - || A||, then £ = sina, where «œ is the minimal angle between 
the line R - A and a line R - B with B € È. In this case, £7! is also called a conic condition 
number (see [35, Chapters 20 & 21]). © 


Fig. 9.1: This picture shows the determinantal hypersurface © = {det(A) = 0} c R?** (the red-blue 
surface in the middle) together with the hypersurface defined by dist(A, £) = €- ||A|| with £ = 0.5 (the 
union of the two green-yellow surfaces on the outside). The picture is drawn in the affine patch where the 
upper left entry of A is fixed to be one. 


In the remainder of this section, we give a proof of the Eckart- Young Theorem. For this, 
we return now to rectangular matrices and denote by X, := {A € R™” | rank(A) < r} the 
variety of real matrices of rank at most r. Recall that Sing(X;) = X;-1. We first compute 
the normal space of X, at a smooth point. Lemma 9.12 can be viewed as a variant of 
Example 2.22, but presented in a more down-to-earth manner. 


Lemma 9.12 Fix A € X, of rank r. Suppose that A = RS", where R € R™*" and S € R 
have rank r. The normal space of X, at A has dimension (m — r)(n — r) and it equals 


NaX, = spanfuv' |u'R=0 and S'v=0}. 


Proof Let R(t) € R”*" and S(t) € R™” be smooth curves with R(0) = R and S(0) = S. 
Then, y(t) := R(t)S(t)" is a smooth curve with y(0)=A. By the product rule from calculus, 


gso Beo 


ð a 
= y(t) ee (9.6) 


t=0 
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Let V := {RP' | P € R™} and W := {QST | Q € R™!}. The equation (9.6) shows 
that the tangent space of X, at A is the sum T,X, =V +W. (Aside for students: what 
is the intersection V N W ?) We note that V consists of all matrices L € R™*” such that 
u' Lx = 0 for all u with u” R = 0 and x € R” arbitrary. Since u’ Lx = Trace(L' ux"), this 
shows that the normal space of V is spanned by matrices of the form ux". Similarly, the 
normal space of W is spanned by yv', where S'v = 0 and y € R” arbitrary. Therefore, 
the normal space of TAX, = V + W is spanned by all uv’ with u and v as above. oO 


Corollary 9.13 dim X, = nm — (m—r)(n-r) =r(m+n-r). 
We use Lemma 9.12 to prove the Eckart-Young Theorem. 


Proof (of Theorem 2.9) Let B € X, be a matrix of rank r. We consider a singular value 
decomposition B = UDV". The matrices U € R™*” and V € R”*” have orthonormal 
columns, and D = diag(oy,...,0,) with o1,...,0, > O (not necessarily ordered). To 
derive Theorem 2.9, we shall prove a claim that is reminiscent of Theorem 8.24. Namely, 
the singular value decomposition of all matrices A € R” such that B is an ED critical 
point for A has the form A = [U U’] ee De [V V']", where D’ = diag(o}41,...,0n)- 
By orthogonal invariance, we can assume that B = [2 o]. Let A € B+ NRX; bea 
matrix in the normal space of B. By Lemma 9.12, we have A = B+ Xt. 41 (on dij eje; 
for some coefficients a;; € R, i.e. 


> where A’ = (aij) = RUN x(n-r). 


Let now A’ = U’D’V"" be the singular value decomposition of A’. Then, 


< , D 0 VT 
A = [U] o p| VI. (9.7) 
This is the desired singular value decomposition of the m x n matrix A. o 


Remark 9.14 Our proof of the Eckart-Young Theorem also works for symmetric matrices. 
It implies that the rank r matrix B, which minimizes the distance to a symmetric matrix A, 
is also symmetric. 
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The formula in (9.5) states that the condition number of matrix inversion at the input 
A € R“ is the inverse squared distance to the variety of singular matrices £. To a numerical 
analyst, the elements of a set like È are the ill-posed inputs of the computational problem. A 
result that connects a condition number and an inverse distance to ill-posed inputs is called 
a condition number theorem. Equation (9.5) yields a condition number theorem for matrix 
inversion. Condition number theorems were derived in [176] for computing eigenvalues of 
matrices, and in [93] for computing zeros of polynomials, as in Example 9.7. 
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Condition number theorems connect metric algebraic geometry with numerical analysis. 
They relate the numerical difficulty of an input datum p to the distance of p to the locus of ill- 
posed inputs. This is explained in detail in Demmel’s paper [55]. The determinant of a square 
matrix is a special case of a discriminant, and £ will generally be a hypersurface. This will 
be made precise in the next section. In this section, we prove a condition number theorem 
for solving systems of polynomial equations. We consider homogeneous polynomials and 
their zeros in projective space. This differs from Example 9.7, where a was a zero on 
the affine line. While this difference may seem insignificant to an algebraic geometer, it 
becomes significant in our metric setting. Namely, affine space with its Euclidean metric and 
projective space with the metric described below are markedly different as metric spaces. 

We fix the real projective space P? and we write Hq for the vector space of homogeneous 
polynomials of degree d in x = (xo, . . ., Xn). Form < nandd = (d1,..., dm) we abbreviate 

Ha := Ha, moet x Han- 
Suppose m = n, let F € Ha and let a € PZ a regular zero of F. By the Implicit Function 
Theorem, there is a locally defined solution function a : U —> V, where U c Ha is a 
neighborhood of F and V c P} is a neighborhood of a, such that F (a(F)) = 0. Our goal 
is to give a formula for the condition number of the solution function a. 

For this, we first need to introduce metrics on the space of polynomials and the space 
of zeros. Since a is defined locally, we can replace projective space Pg by the n-sphere 
S” c R™!. The latter carries the natural metric inherited from its ambient space. In fact, 
we can use the Riemannian metric on the sphere to define a metric on Pa. so that it becomes 
a Riemannian manifold (this approach is worked out in Section 12.3 for complex projective 
space). In this way, Pg becomes curved, while affine space is flat. 

For H4, we use the Bombieri—Wey] metric. Let A := {a € N"*! | o+: -+æ = d}. The 
Bombieri—Weyl inner product between two homogeneous polynomials f = X} ecs fa X% 
and g = J vca Sa X° in Ha is defined by the formula 


ao! ay! 
(f.sdew = ))— a fa 8a: (9.8) 
acA ` 


For d = 1, this is the usual Euclidean inner product in RL, 
The reason for the multinomial coefficients in (9.8) is that the Bombieri-Weyl inner 
product is orthogonally invariant: If U € O(n + 1) is an orthogonal matrix, then 


(foU,goU)pw = (f, 88w. 


Kostlan [110, 111] showed that (9.8) is the unique (up to scaling) orthogonally invariant 
inner product on Hq such that monomials are pairwise orthogonal. This inner product 
extends to Ha. Namely, for tuples F = (f1,..., fin) and G = (fi,..., fin), we define 


(F,G)pw := (fi, 81)BWw +: + (fins 8m) Bw: 
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The Bombieri—Weyl norm is || F'||sw := V(F, F}sw, and the corresponding distance is 
distsw(F,G) := ||F -Gllpw for F,G € Ha. 

Example 9.15 (n = 1, d = 2) Consider two quadrics f (xo, x1) = axe + bxoxı + cx? =x! Ax 


and g(xo,%1) = ax? + Bxox, + yx? = x" Bx, where x = (xo, x1)", with symmetric matrices 


a b/2 _|« ß/2 
and B= lin y 


The inner product of the two quadratic forms is the trace inner product of the two matrices: 


2!-0! 1!- 1! 2!-0! 


(f, g)Bw = a et ay + 7| 


1 
cy = Gat, bB+cy = Trace(A'B). 


An analogous formula holds for quadrics in more than two variables. Thus, the Bombieri— 
Weyl product generalizes the trace inner product for symmetric matrices to homogeneous 
polynomials of any degree. © 


We now turn to the condition number of the solution function a : U — V. The zero 
a = a(F) isrepresented by a unit vector in S”. Proceeding as in Example 9.7, we differentiate 
the equation F(a(F)) = 0. This gives JF (a)à + F(a). Here, JF (a) = (52 (a)) e Rx 0+1) 
is the Jacobian matrix of F at a, and 4 € TaS”. The tangent space 7,S” is the image of the 
projection matrix Pa := J,,; — aa". Note that det(Pa) = 1 — |Jal|? = 0. When a is a regular 
zero of F, the rank of J F(a) is n, and we obtain 


a = (JF(a)|r,s») | F(a). 


This formula is a multivariate version of (9.2). By Theorem 9.6, the condition number 
for a is k[a](F) = maxy Fy.) || (JF (a) |7,5»)~' F(a)||. We use orthogonal invariance of 
the Bombieri—Weyl] metric to compute this maximum. Fix U € O(n+1) such that a = Ueo, 
where eọ = (1,0,...,0) € S”. We have ||F'|pw = ||F o U||pw for all F € Ha. This implies 
that x[a](F) is also obtained by maximizing ||(JF (a)|z,5»)~! É (eo) || over all F € Ha with 
\||F \Isw = 1. Let us write F(x) = xé -b + G(x), where b € R” and G involves only powers 
of xo of degree less than d. Then, F (eo) = b and so ||F'(eg)|| = ||b]| = 4/1 - lIGlĝw- 

This uses the fact that x - b and G are orthogonal for the Bombieri—WeyI inner product. 


0 
We have thus shown that {F (a) | ||F'||pw = 1} is the unit ball in R’*!. Consequently, 


k[a](F) = || (JF(a)lz,5") 7 llop: (9.9) 


Corollary 9.16 The condition number for solving n polynomial equations on P” equals 


1 


daD GTR) Pa) 


(9.10) 


Proof The matrix JF (a)P, represents the linear map JF (a)|z7,5». Hence, the two have the 
same singular values. o 
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Example 9.17 (Univariate polynomials revisited) We examine our formula for n = 1. The 
computation takes a binary form f (x9, x1) to one of its zeros a = (do, a1) in S!. This is the 
projective version of Example 9.7. The Jacobian equals J f(a) = [Of /Oxo(a) Of /0x\(a)]. 
Equation (9.10) yields the following analogue to Equation (9.3): 


1 
k[a](F) = ——-. 
V (Of /Oxo(a))? + (AF /Ax1(a))? 
Here, a is a point on the circle S!, representing a zero of f in P!. © 


We now return to the general case m < n, i.e. we allow systems with fewer equations. 


Proposition 9.18 For x € P?, denote X(x) := {F € Ha | F(x) = 0 and rank JF(x) < m} 
and set D = diag(d,,..., dm). The distance from F to the discriminant X(x) equals 


distaw(F.2(x)) = I|FOOI2 + oin(Do!? F(x) Px)?. 


Hence, if ais a zero of the system F, then we have distgw(F, =(a)) = 0m(D7'/? JF (a) Pa). 


Proof Let U € O(n + 1) with x = Uep and set Fo := F o U. By orthogonal invariance, 
distgw (F, 2(x)) = distgw (Fo, =(eo)). We shall compute the latter. We write 


Fo(x) = x0 b+ x0). B(x1,...,%n)" + A(x), 


where b € R”, B € R” and H has degree < d — 2 in xo. Note that b = Fo(eo) = F(x) 
and B = JFo(e0) Pe, = JF (x) Px U". Recall that G € (eo) if and only if G (eo) = 0 and 
the Jacobian JG (eo) has rank at most m — 1. This means that the distance from X(eg) to Fo 
is minimized at a polynomial system of the form Go(x) = yanl -A (X1,..-,Xn) € (e0), 
where A is a matrix of rank at most m — 1. We have 


|| Fo - Gollgw = llbll? + ID="? (A - B)IP. 


The Eckart-Young Theorem implies that the distance from D~'/* B to the variety of matrices 
of rank at most m — 1 equals Om(D7~'/?B). From this, we infer the desired formula for the 
distance to the variety =(x): 


distaw(F,2()) = af IDI? + om(D-128)? = Y IFO + om (D-2 JEO Py)? 


Here, we use the fact that the singular values are invariant under multiplication with the 
orthogonal matrix UT. o 


Proposition 9.18 can be viewed as a condition number theorem. If m = n and a is a zero 
of F, then the formula for the condition number in (9.10) is equal to the reciprocal distance 
to X(a), up to the scaling factor by the diagonal matrix D. The variety Ł(a) plays the role 
of a discriminant, but it still depends on the point a. The actual discriminant is defined as 


x := {F € Ha | there is x € Pp s.t. F(x) = 0 and rank JF(x) < m}. (9.11) 
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This is the union of the local discriminants =(a) where a runs over P”. Note that È is the 
discriminant since we are asking for x to be a real point. In the next section, we replace & 
by its Zariski closure. This is the variety that is obtained by allowing x to be complex. 

The next theorem gives a formula for the distance from a polynomial system F to 
the discriminant X. This was proved by Raffalli [151] for the case m = 1. Biirgisser and 
Cucker [35] cover the case m = n. 


Theorem 9.19 Let m < nandd = (dj,...,dm) be a tuple of degrees. Let F € Ha. Then, 


distsw(F, 2) = min y [IF (x) |? + om (DJF) Px), 


where, as before, D = diag(d,,...,dm) and Px := In4, — xx! is the projection onto T,S". 
Proof By definition, the discriminant equals © = Uxes X(x). This implies 


distgw(F,2) = min distpw(F’, X(x)). 


The minimum is attained since S” is compact. Proposition 9.18 now yields the claim. o 


Remark 9.20 Theorem 9.19 can be generalized as follows. Fix 1 < k < m. Consider the 
distance from F € Ha to the space of polynomial systems G € Ha such that there exists 
x € P” with G(x) = 0 and rank JG(x) < k. This distance equals 


min 4} |F(x)I? + X (D-2 JF (x) Px), 
xes” =k 


where ox(-),..-;0m(-) are the m — k + 1 smallest singular values. This is because the 
distance from a matrix A € R”” to the nearest matrix of rank at most k — 1 is equal to 


fx Fi (A), by the Eckart-Young Theorem. 


Remark 9.21 Let m = 1. We have only one polynomial f € Hg, and Theorem 9.19 yields 


distgw( f, £) = mingegn 4/ f(x)? + ZIIP Vf(X)||?, where the column vector V f(x) is the 
gradient of the polynomial f at x. By Euler’s formula for homogeneous functions, we have 
x’ - Vf(x) =d- f(x), and hence Px Vf (x) = V f(x) — (d - f(x)) x. We obtain 


distpw(f, 2) = min f(x)? + alIVF(~) - (d- f(x) xP. (9.12) 


Example 9.22 (m = 1,n = 2, d = 3) We compute the discriminant È for ternary cubics: 


R = QO[x,y,z,c0,cl,c2,c3,c4,c5,c6,c7,c8,c9]; 
CO*KA3B4+C1*y434C2*Z434C3* KAZ *Vt+C4*KA2*Z4C5 *X*YA2 

I = ideal (diff(x,o0o0) ,diff(y,o0o0) ,diff(z,00), z-1); 
disc = first first entries gens eliminate({x,y,z},I); 
toString disc 

degree disc, # terms disc 
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We see that X is given by a polynomial with 2040 terms of degree 12 in 10 unknowns. 


We now compute the distance from the Fermat cubic f(x) = x + x + X to the 


discriminant &. This is easy if we use (9.12). We have distgw(f, £) = minges» h(x), 
where h(x) = (x3 +x? +x3) +3 Dig (x? - (xå +x} +23) x). The polynomial function 
h : S* — Ris minimized at the point x9 = wl, 1, 1), and we conclude that 


distgw(f,5) = Va) = a 


Since È is a cone in H3, we can compute the minimal angle (measured in the Bombieri— 
Weyl metric) between f and any polynomial in X as in Example 9.11. Since the norm of 
the Fermat cubic is || f||gw = V3, the minimal angle is arcsin(1/3) ~ 0.21635 - 7: © 


9.4 Distance to the Discriminant 


Theorem 9.19 expresses the distance from a polynomial system F to its discriminant È as 
the optimal value of an optimization problem over the unit sphere S”. Thus, we are solving 
the ED problem from Chapter 2 for discriminants. In this section, we examine this problem 
through the lens of algebraic geometry. This uses the ED duality in Theorem 2.23 and the 
fact that discriminants are projectively dual to toric varieties. 

In the previous section, we considered dense systems, where the polynomials involve 
all monomials of a fixed degree. In that setting, we used the orthogonal invariance of the 
Bombieri—Weyl] metric to prove Theorem 9.19. In many applications, however, polynomial 
systems are not dense but sparse, and there is no such invariant metric. To address this, we 
now work in the setting of n sparse Laurent polynomials in n variables. Our equations are not 


homogeneous. The given data is a collection of finite support sets A1, Az,..., An C Z". 
Our system of equations takes the form fı (x) = fo(x) =--- = f, (x) = 0, where 
f(x) = Py CiaX* for i=1,2,...,n. (9.13) 
acA; 


The BKK Theorem [19] tells us that the number of solutions in (C*)” equals the mixed 
volume MV (P1, P2,..., Pn). Here, P; is the convex hull of A; and the coefficients cia are 
assumed to be generic. Thus, f;(x) is a polynomial with Newton polytope P;. The BKK 
theorem was discussed in Example 3.31. We assume that the mixed volume is at least 2. 

The discriminant of the system (9.13) is the irreducible polynomial A(c) which vanishes 
whenever our equations have a double root in (C*)”. Here € = (Ci a) denotes the vector 
of all coefficients. The discriminant A is unique up to scaling. We refer to [42] for many 
details about A, including a more precise definition. Our goal here is to solve the Euclidean 
distance problem for the discriminant hypersurface 


= = {ceR* | A(c) =0}. (9.14) 
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Remark 9.23 The real discriminant equals the real locus in the discriminant £X. This means 
that the real hypersurface defined (for the dense case) in (9.11) agrees with the definition 
above. Indeed, let c be a generic real point c in the hypersurface &. Then the corresponding 
polynomial system F = (fi, ..., Jn) has a complex double zero x. That double zero x is 
unique. Since F has real coefficients, x is invariant under complex conjugation. Therefore x 
has real coordinates, and ¢ is also in the set (9.11). In fact, there is a formula for expressing 
x by rational operations in c. See (9.15) and [71, Equation (1.28) in Section 12.1.B]. 


From now on, we work over the field C and we view the discriminant © as a complex 
projective hypersurface, defined by an irreducible homogeneous polynomial with integer 
coefficients. This is consistent with [71] and with our earlier chapters. 


Example 9.24 We consider the case n = 1 and A = {0,2,5,6}. The discriminant of the 
polynomial f(x) = cy + c2x? + c3x + c4x® equals 
A = 46656c}c} + 32400c} crc}c} — 3125c} 8 + 13824c?c3ch — 1500c?c3c3c4 
+192cıc3c3c3 + 1024c$cF — 108c3c3. 
This defines a surface © of degree 9 in P’. For any point ¢ € ©, we find the double root x 
of f by evaluating the gradient of the discriminant. This yields 
OA OA ðA OA 
cı ` 0c2 ` 0c3 ` c4 


(ey = (1:x?: xð : x6). (9.15) 


This identity follows from the duality between discriminants and toric varieties. In partic- 
ular, if ¢ is a real vector, then x is a real number. ° 


Given any sparse system F, we seek the distance from its coefficient vector u to the 
discriminant &. We will use the formulation of the ED problem given in Theorem 2.23. 
Let X be the toric variety naturally associated with the tuple (A1, An, . . . , An). Then, the 
dual variety X is precisely the discriminant hypersurface ©. This observation is known as 
the Cayley trick; see [71] and [42, Section 2] for expositions. 

We write x for a point in X and ¢ for a point in =. The conormal variety Nx consists 
of pairs (x, c) such that the hyperplane c is tangent to X at the point x. Equivalently, ¢ is a 
polynomial system whose variety is singular at x. Given any system u, our task is to solve 
the equations x + c = u for (x,c) = (x,u — x) € Nx. The desired distance from u to È is 
the minimum of ||x|| over all solutions to these equations. In symbols, we have 


dist(u, £) = min |u- c| = min — |x|. (9.16) 
cex x,u—x) €Nx 


Here, the minimum is taken over all real points. Of course, the minimum value depends on 
the metric that we choose. In the previous section, we took the Bombieri—Wey] metric. 


Example 9.25 (n = 1, d = 3) Fix a polynomial u = ug + u1t + uzt? + u3t° with three distinct 
complex roots. The coefficients uo, u1, u2, u3 are fixed numbers. Here A = {0, 1,2, 3}, so 
X = Xq is the twisted cubic curve in P*. The discriminant XY = È is given by 


2 
A= Ge, - 4coc3 - Acics + 18c9c1c2¢3 — eM lard ore 
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as computed in Example 4.13. We consider the following problem: 

minimize ||u—el||? = (uo — co)? + (u1 — c1)? + (u2 — c2)? + (u3 — c3)? subject to c€ È. 
The conormal variety Nx is a 4-dimensional affine variety in C8. Its prime ideal equals 


(xt — x0x2, X1%2 — X0X3, X2 — X13, A, 


CoX9 — C2X2 — 2C3X3, C1X1 + 2C2X2 + 303%3, 3C9X1 + 2C1xX2 + C2X3, C1 XQ + 202K] + 303X2 ). 


The equation x +c = u has 7 complex solutions in Nx, since X has ED degree 7. We 
seek the one that minimizes ||x||*. For a numerical example, consider the polynomial 
323 — 2617 + 61t — 30, which is represented by the vector u = (—30, 61, —26, 3). We solve 
the equations x + c = u and (x,c) € Ny. Among the seven complex solutions, three are 
real. The optimal value is ||x||* = 0.0251866, and this is attained by the point 


c = u-x = (—29.997, 61.011, —25.959, 3.153). 


In larger cases, where computing the ideal of Nx is infeasible, we can still solve the ED 
problem with the monomial parametrization of X. In our example, the problem is: 


3 3 
minimize st’)? over all critical points of s, t) bo ui — st? f; (9.17) 
DAG) p 
i=0 i=0 


The optimal point is (s, t) = (—0.002824, 3.78442), and the optimal value is 0.0251866. © 


We now turn to the general case n > 1. The toric variety X has dimension 2n — 1, 


and it is associated with the Cayley configuration of A1, A2, ..., An. To define this, we 
introduce new variables y1, y2,..., Yn. We encode our polynomial system F = (f1,..., fn) 
into the single polynomial y (x,y) = };-1 Baca; Ci,a Yi X". The toric variety X = Xq is 
parametrized by all monomials y; x*, where i = 1,...,n anda € Aj. 


Consider the hypersurface Q := {(x, y) € (C*)?” | w(x, y) = 0}. The discriminant È is 
a hypersurface in the space of coefficient vectors € = (c;,q). It comprises all ¢ such that Q 
has a singular point. Therefore, the toric variety X is dual to the discriminant È}. Points u in 
the common ambient space of X and È are identified with polynomial systems (9.13) that 
have concrete numerical coefficients. The distance from such a system to the discriminant 
=x can be computed as described in (9.16). We summarize our discussion as follows: 


Theorem 9.26 The Euclidean distance from a polynomial system u to the discriminant ©} 
equals the smallest norm ||x|| among all points x in the toric variety X that are critical for 
the distance to u. 


We emphasize again that the distance crucially depends on the norm that we choose. 
This was Bombieri—Wey] in Theorem 9.19. The points on X that are critical for the distance 
to u are best computed with the monomial parametrization. For n = 1, we saw this in (9.17). 


Example 9.27 (Hyperdeterminant) Fix n = 2 and A; = Az ={(0, 0), (0, 1), (1, 0), (1, 1)}. 
The Cayley configuration is the 3-cube, and X is a 4-dimensional affine toric variety in CË, 


9.4 Distance to the Discriminant 121 


namely the cone over the Segre variety P!'xP!xP! c P”. The points ¢ in CË correspond to 
pairs of bilinear equations in two unknowns: 


C11 + C12f1 + C132 + C14t1t2 = C21 +C22ft1 + C23t2 + C24t1t2 = 0. (9.18) 


For generic c, this polynomial system has 2 solutions, and the two solutions coincide when 
the discriminant A vanishes. The discriminant for (A1, Az) is the 2X22 hyperdeterminant 
computed in Example 3.16: 


eye ae 2 eo eae 2 a2 
A= Cy 1054 + Ci€93 + C1309 + Cy 40>] + 4C11C14C22C23 + 4c12€13C21C24 = 2C11C12C23C24 


—2¢11€13C22C24 — 2C11C14C21C24 — 2C12C13C22C23 — 2C12C14C21C23 — 2C13C14C21022. 


Given any polynomial system u of the form (9.18), we seek its squared distance to the 
hyperdeterminantal hypersurface X = {A = 0} in R8. This number is the optimal value of: 


minimize A ya + i + t? + tt) over all critical points of the function 
O1 Yast, t) > ZZ ((ui — yi)? + (uiz — yit1)? + (us — yita)? + (ui4 — yitit2)°). 


This objective function has 6 critical points over C. The ED degree is 6, by [60, Exam- 
ple 8.2]. The ED degree jumps to 34 if the Euclidean norm is replaced by any nearby generic 
quadratic form. Indeed, the generic ED degree of the hyperdeterminant A equals 34. This 
can be seen by summing the polar degrees in the column labeled k = 3 in Table 5.2. © 
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Chapter 10 A 
Machine Learning Check tor 


| updates 


One of the principal goals of machine learning is to learn in an automated way functions 
that represent the relationship between data points. Suppose we are given the data set 
D = {(X1,Y1),---, (Xa, Ya)} C R” x R”. The vectors x; are the input data. The vectors y; 
are the output data. For instance, in image classification, the x; might encode images of 
certain objects and the y; are the respective classifiers. Another popular example from 
generative AI is the scenario where x; and y; encode word tokens that co-occur in text data. 

The goal is to find a function f : R” — R” such that f(x;) ~ y;. The meaning of ~ rests 
ona loss function L(y’, y). We seek to minimize the mean loss €p(f) := 4 oe L(f (x), yi) 
over a class of functions f. Often, the loss function comes from a metric. For instance, the 
squared-error loss is the squared Euclidean distance I(y’, y) = |ly’—y||* in output space R”, 
but also the Wasserstein distance from Chapter 5 and Kullback—Leibler divergence from 
Chapter 11 are used. One usually restricts to a model, consisting of functions that are 
specified by N real parameters 0 = (61,...,9n) € RY. To learn a function means to 
compute @ that minimizes the loss for the data D. 

Problems of the type described above fall under the header supervised learning. Here, 
every input data point x; has an associated output y;, also called label, and the goal is to 
find a relationship between them. By contrast, in unsupervised learning, we are given data 
points D = {x,,...,xg} C R” without labels. The goal in unsupervised learning is to 
describe the data with as few parameters as possible. For instance, assuming the variety 
hypothesis, our goal could be to learn a variety X c R” that represents the data D. Here, the 
parameters define the polynomials that cut out X. We discuss this approach in Section 10.3. 

The next sections offer a glimpse of machine learning from a geometry perspective. Fora 
systematic introduction, we refer to the book Mathematical Aspects of Deep Learning [75]. 


10.1 Neural Networks 


The model we focus on is the (feedforward) neural network. Such a network is a composition 


f := flo ° +: o f0 O fio. 
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Here, fi o is a function that depends on a parameter @. The function f is called the end- 
to-end function. We call L the number of layers. The dimension of the domain of f; o is 
the width of the i-th layer. The number of layers, their widths, and the type of each layer 
function f; ọ constitute the network’s architecture. The variables of the i-th layer f; ọ are 
neurons or nodes. A neural network is often visualized by a graph as in Figure 10.1. 


Fig. 10.1: A fully connected neural network with two layers fi,9 : Rt — R? and fo,9 : R? > R°. 


We say that the network is fully connected if, for every index i = 1,...,L, every 
component of the i-th layer f; ọ depends on every variable/neuron/node. Figure 10.1 shows 
a fully connected network. On the other hand, Figure 10.2 shows a network that is not fully 
connected. Some of the edges are missing. 


ieee 

eo. 
x, 

eo. 
x, —— 

eo. 
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Fig. 10.2: A neural network with two layers that is not fully connected. 


Most commonly, the layer functions are compositions of an affine linear map a; g with 
a (typically nonlinear) map o; : R — R that is applied componentwise: 


fio = (Oj,...,07) © Qi,- (10.1) 
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The map o; is called the activation function. One example of an activation function is 
the Rectified Linear Unit (ReLU), which is a(z) = max{0, z}. Methods from (metric or 
tropical) algebraic geometry can be used to study neural networks when the activation 
function has an algebraic structure (identity, polynomial, ReLU, etc.). 

Here, we consider the simplest case where the activation function is the identity. The 
theoretical study of neural networks is much more challenging if that function is nonlinear. 
For instance, the end-to-end function of a ReLU neural network (i.e., all activation functions 
are ReLU) is piecewise linear. In fact, every piecewise linear function with finitely many 
pieces arises from a fully connected ReLU network [8]. ReLU end-to-end functions can 
be interpreted as tropical rational functions [180]. That perspective was further developed 
in [134] to provide sharp bounds on the number of linear regions of the end-to-end functions. 
Another geometric perspective on ReLU networks is described in [74]. A first algebro- 
geometric study for networks with polynomial activation functions appears in [105]. 

The term expressivity refers to which functions neural networks with fixed architecture 
can express. Thus, expressivity is the study of the space M of all functions that are given 
by our network. This function space M is often called the neuromanifold of the neural 
network architecture. Note that M is typically not a smooth manifold. The neuromanifold 
is the image of the network’s parametrization map 


u:RY >M, 0 froo o free fio. (10.2) 


The loss function fy is defined on M. We can pull it back to the parameter space via u, 
i.e. we consider the composition 


L: RY b M R. (10.3) 


Instead of minimizing the loss function fy over the neuromanifold M, training a neural 
network means minimizing L over the parameter space R of the neuromanifold. 


Remark 10.1 The numerical uncertainty in minimizing the loss depends on the condition 
number of the function £. Following Chapter 9, it would be interesting to compare the 
condition numbers of £ and £p and how they are related to the network’s architecture. 
There is lots of room for metric algebraic geometry to contribute. 


One of the big mysteries in machine learning theory is why training neural networks 
results in “nice” minima. The concrete meaning of the adjective nice varies in the literature. 
See [130] for an algebro-geometric perspective on “flat” minima. Training a neural network 
minimizes the loss function £ = £p © u in (10.3) on the parameter space RY. The mean- 
ingful critical points are those that come from critical points of n on the neuromanifold 
M. Formally, such pure critical points 0 of £ satisfy that u(@) is a critical point of the 
functional £p restricted to the smooth locus of M. The network parametrization map ju 
can induce additional spurious critical points of the function L. 

The following two questions play a central role in the study of learning problems: 


1. How does the network architecture affect the geometry of the neuromanifold M? 
2. How does the geometry of the neuromanifold impact the training of the network? 
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We shall discuss these questions for two specific network architectures. In the remainder 
of this section, we start with the simplest class, namely networks that are linear and fully 
connected. Thereafter, in Section 10.2, we turn to linear convolutional networks. This will 
lead us to interesting algebraic varieties. 

In a linear fully connected neural network, the activation function o; in layer i is 
the identity and the layer functions f;,ọ are arbitrary linear maps, for i = 1,..., L. The 
parameters @ are the entries of the matrices representing the linear layer functions. The 
network parametrization map (10.2) specializes to 


x ki xk koxk kLXkL- kLXki 
u: R“ 0 x RX x... x REEXKE-1 _, RXEXK0 | 


(10.4) 
(Wi, W2,..., WL) =œ WL- WoW. 


Letus first discuss the expressivity of this model. The image of the parametrization map u 
consists of all matrices with rank at most min(kọ, k1, . . ., k). Hence, the neuromanifold 


M = {We R" | rank(W) < min(ko, ki, ..., kL)} 


is a determinantal variety. If the input or output dimension is one of the minimal widths, then 


M is equal to the whole ambient space. Equivalently, in symbols, if min (kọ, kı, . . ., kL) = 
min(ko, kz), then M = R‘-**0, Otherwise, M is a lower-dimensional Zariski closed subset. 
The singular locus of M is the variety of matrices of rank at most min (kọ, k1, ..., kz) -— 1, 


and therefore, it is the neuromanifold for a smaller network architecture. 

Next, we turn to questions about optimization. Theoretical studies on the optimization 
problem of training neural networks can be roughly grouped into static and dynamic studies. 
Static investigations concern the loss landscape [124] and the critical points of (10.3), while 
dynamic studies focus on a training algorithm, e.g., by investigating its convergence. 

We first discuss static properties. For linear fully connected networks, the pure and 
spurious critical points were characterized in [33]. The critical points 6 of £ such that u (0) 
has maximal rank are pure [33, Proposition 6]. All spurious critical points 0 correspond to 
lower-rank matrices 44(@). These are essentially always saddles [33, Proposition 9]. If £p is 
smooth and convex, then all non-global local minima of £ (often called “bad” minima) are 
pure critical points. It is a common, but false, belief that linear fully connected networks 
do not have bad minima. Our next result follows from [33, Proposition 10]: 


Theorem 10.2 Consider a linear fully connected network and a smooth and convex func- 
tion lp. The function L = £p © u has non-global local minima if and only if €p|Reg(M) 
has non-global local minima. 


This result immediately implies that linear fully connected networks in two special 
settings do not have bad minima. First, if M equals the ambient space (i.e., the input or 
output dimension of the network is its minimal width), then any convex function fp has 
exactly one minimum on M, and so £ does not have any bad minima. This is the main 
result from [119]. The second setting concerns the squared-error loss 


d 
1 
lp : REX 5 R, We qA Wx: - vill’, (10.5) 


i=] 
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for the training data D = {(x1,y1),..-,(Xa,Ya)} C Rk x Ree, Writing X € Rkox4 and 
Y e R*<*4 for the data matrices whose columns are x; and y j» the squared-error loss 
becomes the squared Frobenius norm 


1 
lp(W) = allwx - YIP’. (10.6) 


If XXT has full rank, then we record the data in the matrix U = YX™((XX7)2)71. 
Minimizing (10.6) is equivalent to minimizing the squared Euclidean distance ||W — U||? 
over all W € Reg(M); see [33, Section 3.3]. According to the Eckart-Young Theorem, this 
optimization problem has a unique local and global minimum, provided the singular values 
of U are distinct and positive. Hence, given many generic data pairs (x;, y;), the squared- 
error loss fy has no non-global minima on Reg(M). Theorem 10.2 now implies that the 
squared-error loss £ on the parameter space has no bad minima. This is a prominent result 
in machine learning, often attributed to [11] or [103]. However, the two settings where 
either M is a vector space or we consider the squared-error loss are rather special. Non- 
global local minima are expected for other loss functions and architectures where M is a 
proper determinantal variety; see [33, Example 13]. 

Now, we turn to dynamical properties. Optimization algorithms used in the training of 
neural networks are variations of gradient descent. After picking initial parameters 0, gra- 
dient descent adapts the parameters successively with the goal of minimizing the loss £(6). 
When training linear fully connected networks with the squared-error loss, where the data 
matrix XX" has full rank, gradient descent converges for almost all initializations (under 
reasonable assumptions on its step sizes) to a critical point 6 of the loss £ [136, Theo- 
rem 2.4]. Moreover, the matrix u(0) € M is a global minimum of £p restricted to the 
smooth manifold of all matrices of the same format and rank [136, Theorem 2.6]. The 
authors of [136] conjecture that the matrix 4(@) has maximal possible rank, so it is a 
smooth point on the determinantal variety M. 

An essential ingredient in the convergence analysis of [136] are the algebraic invariants 
of gradient flow. The curve in parameter space traced by gradient flow is typically tran- 
scendental, but it does satisfy some algebraic relations. In other words, its Zariski closure 
is not the whole ambient parameter space. 


Proposition 10.3 ([9]) Consider a linear fully connected network with parametrization 
(10.4). Let O(t) = (W,(t), Wo(t),...,Wz(t)) be the curve traced out by gradient flow, 
starting at t = 0. Then, the L — 1 matrices 


W(t) Wi(t) — Wi-1(t) W(t, where i = 2,3,...,L, 
remain constant for all times t > 0. 


We discuss some practical consequences of this result. One calls a parameter tuple 
0 = (Wi,..., WL) balanced if WW; = Wi-1W,_, fori € {2,...,L}. Note that all 
matrices in a balanced tuple have the same Frobenius norm: ||W1 || = ||W2|| =--- = ||WzIl. 
If a linear network is initialized at a balanced tuple 6(0), then every tuple 0(t) along the 
gradient flow curve is balanced, by Proposition 10.3. Since all matrices in @(t) have the 
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same Frobenius norm, it cannot happen that one matrix converges to zero while another 
matrix has entries that diverge to infinity. In fact, if one matrix converges to zero, then 
so does the whole tuple 6(t). Algebraic invariants of gradient flow were also studied 
in [108, Proposition 5.13] for linear convolutional networks (cf. Section 10.2). For ReLU 
networks and other networks, see [177, Lemma 3] and [62, Theorems 2.1-2.2]. 


Remark 10.4 (Nonlinear Autoencoders) Invariants also play a crucial role in the study of 
attractors of autoencoders. An autoencoder is a composition of two neural networks: an 
encoder and a decoder network, such that the input dimension of the encoder equals the 
output dimension of the decoder. Given training data x4, . . . , Xq, the composed network is 
typically trained by minimizing the autoencoding loss 


d 
f= XIE- xi. 
i=l 


Note that x; serves both as input and as output data. It is shown in [150] that an 
autoencoder trained with gradient descent on a single training example x (i.e., d = 1) 
memorizes x as an attractor (under suitable assumptions on the activation function and 
initialization). Namely, x is a fixed point of the learned function f such that the sequence 
(f i (y)) jen CONVerges to x for any y in an open neighborhood of x. 


10.2 Convolutional Networks 


A linear neural network parametrizes matrices that admit the structured product decom- 
position (10.4). When each layer is a one-dimensional convolution, the matrices W; are 
generalized Toeplitz matrices. An example of such a matrix is 


Wo W1 W2 0000 
0 0 Wo W1 W2 0 0]. (10.7) 
0000 Wo W1 W2 


A convolution on one-dimensional signals is a linear map that depends on a filter 
w = (wọ, ...,Wķ-1) € Ré and a stride s € N. It computes the inner product of the filter w 
with parts of a given input vector x, and traverses the whole vector x by moving w through 
it with stride s. The formula for this linear map is 


k-1 ma 
RSO D+k _, R”, xeo b> Wj- age (10.8) 
o i=0 
For instance, for m = 3, the convolution in (10.7) has filter size k = 3 and stride s = 2. 
A linear convolutional neural network is the composition of L convolutions with filter 


sizes k = (k1, k2, ..., k) and strides s = (s1, s2, ..., SL). All activation functions in this 
network are the identity. The resulting end-to-end function is also a convolution with filter 
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size k = kı + Yoh - 1) mi! si and stride s = s,s2---sy. This was shown in [108, 
Proposition 2.2]; see also [109, Section 2]. The point is this: any product of generalized 
Toeplitz matrices like (10.7) is again a generalized Toeplitz matrix. The parametrization 
(10.4) sends the filters of each layer to the filter of the end-to-end convolution: 


u: Re xRE x- -xR > RE. (10.9) 


The map y can be evaluated via polynomial multiplication. This is done as follows. For 
any positive integers k and t, we write R[x’]<,_; for the space of univariate polynomials 
of degree at most k — 1. The variable in these polynomials is the power x’. We identify any 
filter of size k with the coefficient vector of such a polynomial via 


Tre |: RE > Rilek wore wo x!) +w x 4 wga +w. 


The filter w of the end-to-end convolution corresponds to the polynomial 71,,(w) that 
admits a sparse factorization into polynomials corresponding to the filters w1, W2,..., Wz: 


Tk (M(W1,-- 5 W)) = Wek (WL) t Minko (W2) + Hy k, (W1), t = S182°+* 81-1. 


The powers t = (f1,...,¢z) of the factors in the above formula uniquely encode the 
strides s = (s},...,5z-1) of the convolutions. The last stride sz does not influence the 
filter (Ww ,...,Wz) of the end-to-end convolution. We summarize our discussion in the 
following result, which is found in [109, Proposition 2.2]. 


Proposition 10.5 The neuromanifold Mk + for a linear convolutional neural network with 
filter sizes k = (kı, ko,..., ki) and powers t = (ti, t2,..., ti) is parametrized by polyno- 
mial multiplication, namely 

Mm: REx" Jcg-1 X REx?) cu X +++ x REX] cu > Rix) ce, 


(10.10) 
(P1,P2.+++>PL) © PiP2' PL- 


Example 10.6 (L = 2, t = (1,2)) For k = (3,2) and k = (2,3), the matrix products are 


Ug Uy U2 0 0 
[vo vı] k 0 Uuo uy U2 


= [uovo Uio Uv) + U2v0 U10 uzi], 


Ug uy 000 0 
[vo v1 v2] 0 O uou, 0 0| = [uovo U,VQ UgU, U10} UoU2 uiw]. 
0 0 0 O0 uu 


These correspond to multiplying pairs of polynomials, as specified by Proposition 10.5: 


(uox? + ux +u2)(vox? +01) and (uox + u1)(vox + vix? + v). (10.11) 
The neuromanifold M.¢ consists of polynomials that admit such a factorization. © 
For linear convolutional networks, the powers t = (ft), f2,...,t,) satisfy that tf; = 1 


and t;-; divides t; for each / = 2,..., L. From the algebraic perspective, it is interesting 
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to study the polynomial multiplication map (10.10) for arbitrary positive integers tz. Its 
image Mk, lives in the vector space R[x] <k-1 = RÝ, where k — 1 = Si tı(kı — 1). By 
Tarski’s Theorem on Quantifier Elimination, Mkķ,t is a semialgebraic set, i.e., it can be 
described by a Boolean combination of polynomial inequalities. 

As is customary in applied algebraic geometry, we simplify our problem by replacing 
Mx. with its Zariski closure Wt in the complex projective space P*~!. Thus, by definition, 
the variety Vķ,t is the image of the map 


purl x pel x... pel _, pl (p1, po,..., PL) Œ P1P2** PL- (10.12) 


Proposition 10.7 The map in (10.12) has no base points, so it is a morphism. Moreover, 
the map (10.12) is finite-to-one. If the integers t),t2,..., tp are pairwise distinct, then the 
map is generically one-to-one. 


Proof A product of polynomials can only be zero if one of its factors is zero. Hence, 
the projective multiplication map (10.12) is well-defined at all points in its domain. The 
map is finite-to-one because the irreducible factorization of a polynomial is unique, up 
to reordering the factors. If all factors have distinct powers, then the factors of a generic 
polynomial cannot be swapped, and the map is generically one-to-one. oO 


Corollary 10.8 The variety Vx, has dimension kı + k2 +--+ k — L, and its degree is 


(ki+k2+::-+kL - L)! 


-DeD ee divided by the size of the general fiber of (10.12). 
L= . a we EN E= . 


Proof We consider the variety P'-! x P&-! x ... x Pkt! in its Segre embedding (see 
Definition 12.4). The map (10.12) is the composition of this Segre embedding followed 
by a linear projection. That projection has no base points by Proposition 10.7 and thus 
preserves dimension. Moreover, the linear projection is finite-to-one and of the same 
degree 6 as (10.12). Hence, the degree of its image Vt is the degree of the Segre 
variety P% x ... x PKL! divided by 6. The degree of the Segre variety is computed in 
Corollary 12.21 below. It is the multinomial coefficient in the stated formula. o 


Example 10.9 If k = (2,2,...,2) and t = (1, 1,..., 1), we simply multiply linear polyno- 
mials. Here k — 1 = L. The map (10.12) is onto, by the Fundamental Theorem of Algebra, 
and hence Yt = P4. The fiber has cardinality L! since the linear factors can be reordered 
arbitrarily. The multinomial coefficient in Corollary 10.8 is also L!, so the degree of the 
image is 1. The neuromanifold Mj, ¢ is Zariski dense in R[x] <z. It consists of all real-rooted 
polynomials of degree < L, so it is an interesting semialgebraic set. © 


The size of the general fiber in Corollary 10.8 can be read off from the filter sizes k and 
the powers t, namely, it is the order of the symmetry group of the factorization (10.10). 
Such a symmetry arises whenever t; = t; fori + j. In this case, we can merge the factors 
R[x" ] <g,-1 and R[x” ] <x,;-1 into a single factor R[x" ] <x,+,-2 without changing the image 
variety Vt. Therefore, for computing the degree or other properties of Vk,t, we can always 
assume that t has distinct coordinates. For example, suppose that L = 6, k = (5,4,3,7, 6,3) 
and t = (1,5,5,5,3,3). Then Vit C P® has dimension 22 = X$; (k; — 1). After merging 
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as described above, our new parameter vectors are k = (5, 12,8) and t = (1,5, 3). Using 
Proposition 10.7, we now see that the degree of Vt equals 22! = 232792560. 


47) 
Equipped with Corollary 10.8, we explore the prime ideal of Vk,t in some small cases. 
This ideal lives in the polynomial ring Q[co, c1,..., Ck-1] where p(x) = Do c;x! is inthe 


codomain of (10.10). We start with Example 10.6. The two varieties are cubic threefolds, 
in P^ resp. P*. Their prime ideals are 


(coc + crea —c1C2¢3) and (coc3 —C C2, CoC5 — C1C4, C2C5 —€3C4). (10.13) 


The second ideal shows that V(2,3) (1,2) = P! x P? c P5. The first ideal defines the variety 
V(3,2),(1,2)> Which is a projection of that Segre threefold from P’ into P*. 


Example 10.10 (L=3, t = (1, 2, 2)) We extend the polynomials from (10.11) by multiplying 
them with a sparse quadratic factor: 


p(x) = (u2 + uix + uox?) (vi + vox?) (w1 + wox?) for k= (3,2, 2), 
and p(x) = (wy +uox)(v2 + vix? + voxt) (wi + wox?) for k= (2,3,2). 


The resulting varieties Vķ,t live in Pô and P” respectively. We can understand them by 
examining the factors of p(x) + p(—x) and p(x) — p(—x). Both varieties are 4-dimensional 
and they admit nice determinantal representations. The ideal of V3.9.) ,(1,2,2) is generated 
by the four 3 x 3 minors of the 3 x 4 matrix 


Co C2 C4 C6 
C1 C3 C5 0 
0 C1 C3 C5 


While the parametrization fork = (2, 3, 2) is not a monomial map, the variety V(2,3,2),(1,2,2) 
is still toric. Its prime ideal is generated by the six 2 x 2 minors of the 2 x 4 matrix 


Co C2 C4 C6 
cieca 


We conclude that V(2,3,2),(1,2,2) is the Segre embedding of P! x P? into P”. ° 


These examples suggest that our varieties Vj ¢ are interesting objects for further study in 
combinatorial commutative algebra. From the perspective of metric algebraic geometry, one 
should pursue the concepts introduced in the previous chapters. One of these is Euclidean 
distance optimization. Computations show that the two varieties in Example 10.6 have ED 
degrees 10 and 2, while the varieties in Example 10.10 have ED degrees 23 and 2. Can we 
find a general formula for the ED degree of Vk,t in terms of k and t? 

We turn our attention to describing the singular locus of the variety Vg, that represents 
convolutional networks. As argued above, we can restrict ourselves to the case when the 
powers t = (t1,...,fz) are pairwise distinct integers. The singular locus was determined 
for those varieties Vj¢ that come from linear convolutional networks (i.e., under the 
assumption that t7_;|t;) in [109, Theorem 2.8]: 
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Theorem 10.11 Let 1 = tı < t2 <... < ty be integers such that t)_-\|t; for alll =2,..., L. 
The singular locus of Vk s consists of all its proper subvarieties Vw +. 


This result says that the neuromanifold is singular along smaller neuromanifolds of 
convolutional networks with the same strides. Hence, as in the case of linear fully connected 
networks, discussed in Section 10.1, the singular locus of the neuromanifold is parametrized 
by smaller network architectures. 

In the remainder of this section, we discuss critical points of training linear convolutional 
networks. For that, we have to return to the semialgebraic neuromanifold Mx. This 
set is closed in the Euclidean topology. Describing its Euclidean relative boundary is a 
challenging problem, studied in [109, Section 6]. 

When all strides are one (i.e., t = (1,...,1)), the neuromanifold My is a full- 
dimensional semialgebraic subset of the ambient space R*. Here, critical points of the 
loss function often correspond to points on the Euclidean boundary of Mx. These are 
critical points of the network parametrization map u. This stands in sharp contrast to con- 
volutional networks where all strides are larger than one (i.e, ty < t2 < ...) and Mx is a 
lower-dimensional subset of R*. The following results from [109, Theorem 2.11]. 


Theorem 10.12 Consider the map u in (10.10) with integers 1 = tı < ty < ... < tp such 
that ty-\|t; for alll =2,...,L. Let d 2 k, and let £n be the squared-error loss (10.5) for 
the end-to-end convolution W. For almost all d-tuples D of training data, every critical 
point 0 of L = £o o u satisfies: 


(a) either u(8) = 0, or 
(b) 0 is a regular point of u and u(0) lies in the smooth locus of Vk t and the Euclidean 
interior of Mx. 


We summarize our discussion on the training of linear networks with the squared-error 
loss. For fully connected networks as in Section 10.1, the neuromanifold M is a classical 
determinantal variety. Its Euclidean relative boundary is empty. Nevertheless, spurious 
critical points commonly appear; they correspond to singular points of M. For convolu- 
tional networks of stride one, M is semialgebraic, Euclidean closed, and full-dimensional. 
Here, the singular locus (of its Zariski closure) is empty, but often critical points are on its 
Euclidean boundary and are thus critical points of u. Finally, for convolutional networks 
with all strides larger than one, M is a semialgebraic, Euclidean closed, lower-dimensional 
subset. Its Euclidean relative boundary and its singular locus are usually nontrivial. Never- 
theless, these loci are not relevant for training the network when using a sufficient amount 
of generic data: All critical points — except when a filter in one of the layers is zero — are 
pure and correspond to interior smooth points of M. 


Remark 10.13 In this section, we studied convolutions on one-dimensional signals (i.e., 
vectors). Many practical neural networks, notably in image processing, use convolutions 
on two-dimensional signals. Higher-dimensional convolutions move a filter tensor through 
an input tensor. The composition of such convolutions corresponds to the multiplication 
of multivariate polynomials [108, Section 4.3]. 
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Suppose that we are given d data points D = {x1,...,xg} C R”. We assume the variety 
hypothesis. This means we assume that the points in D are (possibly noisy) samples from 
a real algebraic variety X c R”. The goal is to learn the variety X from the data D. The 
unknown variety X is represented by the ideal /(X) of all polynomials that vanish on X. 
Assuming X to be irreducible, the ideal /(X) is prime. However, /(X) is unknown. All we 
are given is the finite set D. In this section, we discuss strategies to learn an ideal Zp which 
is meant to approximate /(X). Our discussion is an invitation to the article [26]. 


Remark 10.14 In some situations, one is interested in specific features of X, such as its 
dimension or homology. In this case, learning the ideal means asking for more information 
than what is actually needed. Other strategies can be more efficient: Theorem 15.2 in 
Chapter 15 gives conditions under which one can recover the homology of X from the 
finite sample D by computing the Cech complex of a union of balls. For estimating the 
dimension of X, see [26, Section 3] and the vast literature on data dimensionality. 


We now focus on learning the approximate ideal Ip. First, we need to find an ap- 
propriate loss function. One immediate option is to use the mean squared loss function 
lp(f) = 4 oy f (x;)?. Next, we must define a threshold £ > 0 so that we keep only those 
polynomials for which €p(f) < e. Moreover, since f € I(X) if and only if à - f € I(X) 
for every A € R, we must work with some kind of normalization for f. The next example 
illustrates one approach to computing the approximate ideal Jp using linear algebra. 


Example 10.15 Suppose n = 2 and that we have d = 3 data points D = {x,y,z} c R’. 
Given € > 0 we want to compute polynomials f € R[x1, x2] of degree at most two that 


(a) have mean squared loss at most £ > 0, and 
(b) satisfy the normalization 


2 2 
fit tf =l, 
where f = fi x? + faxx + fax5 + faxi + fox2 + fo. 


For this, we set up the bivariate Vandermonde matrix of degree < 2 for our data points: 


xi X1X2 x5 Xp X 1 
Mp = | yz yiy2 yz yı y2 1] e R*®. (10.14) 
Zi 2184 Z Za z l 


The mean squared loss equals 154 fix)" = 4llMo fll’. Suppose that f € R is a 
right-singular vector of Mp with singular value o-. Then, ||Mp f ||? = 0. This means one 
can compute the singular value decomposition of Mp and keep those right singular vectors 
of Mp whose squared singular values are at most d - e. (Caveat: this does not yield all 
polynomials with mean squared loss at most £). The polynomials corresponding to these 
singular vectors generate an ideal, which is a candidate for Ip. © 


The approach from Example 10.15 can be generalized to any ambient dimension n and 
any number of points d. The matrix in (10.14) is called a multivariate Vandermonde matrix. 
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In the general setting, it is a d X m-matrix, where m is the number of monomials we are 
considering for our polynomials in Zp. In Example 10.15, all monomials of degree at most 
two are used. In general, the choice of monomials is a hyperparameter for this approach. 
This means that we have to specify the monomials before solving for f. 

In Example 10.15, we used the singular value decomposition for computing polynomials 
that generate our ideal Ip. Two alternative methods are discussed in [26, Section 5], 
namely the QR-decomposition and row-echelon, also for the Vandermonde matrix. All 
three methods have their advantages and disadvantages. 

In other situations, one might have prior knowledge on the polynomials that generate 
1(X). This might come from geometric considerations (as in [26, Section 2]). These might 
suggest determinantal representations, or torus actions on X, which ensure that /(X) 
is homogeneous with respect to some multigrading. This amounts to information about 
sparsity. Sometimes, we have a priori bounds for the degrees of the generators of /(X). 
The “variety of varieties” in [26, Section 2.2] suggests many options. 

In order to search for sparse polynomials, we can utilize methods from compressed sens- 
ing. This field is concerned with computing sparse solutions to underdetermined systems 
of linear equations. Moreover, polynomials coming from specific applications often have 
rational or integer coefficients, so one might seek polynomials that also have this property. 
For this, finite field techniques can be very useful. 

For the normalization of the polynomials in Example 10.15, we used the Euclidean 
norm of the vector of coefficients. An alternative approach is to fix a monomial ordering 
on R[x1,...,X,] (see Chapter 3), and then require the leading coefficient of every f to 
be 1. This leads to the following definition. 


Definition 10.16 Let D = {x1,...,xqg} C R” be a finite sample of points and £ > 0. The 
&-approximate vanishing ideal over D is 


d 
| := (f | DI f(X)? < £ and the leading coefficient of f is i}) 
i=l 


We always have the inclusion /(X) c 1%. In the limit, when the sample D consists 
of all points in X and € = 0, we have ike = I(X). However, it is not known under which 
conditions the dimension of the variety of 74 is equal to the dimension of X, or how their 
degrees are related (if at all). There is a vast literature on the approximate vanishing ideal 
and how to compute it. We refer to [148] and the references therein. 

Wirth and Pokutta [148] present a novel algorithm for computing the approximate 
vanishing ideal 7, given any tolerance £ > 0. In contrast to the approach from Exam- 
ple 10.15, for their algorithm, they do not have to specify monomials; i.e., monomials are 
not hyperparameters. The algorithm of [148] returns sparse generators for the ideal 715. 

Both approaches presented above use the mean squared loss that comes from evaluating 
polynomials. A more geometric choice for the loss would be £p = 4 EZ \|x; — X||?, where 
||x — X|| is the Euclidean distance from x to X. This idea has not been worked out, neither 
from a theoretical nor from an algorithmic perspective. This desirable aim underscores the 
relevance of the ED problem (2.1) for machine learning. 
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Remark 10.17 In Chapter 15, we present methods for sampling points from algebraic 
varieties given their equations. Thus, once we have learned equations, we can set up a 
generative model that produces synthetic data. For instance, Figure 15.4 shows points that 
have been generated near the Trott curve. For enlightening varieties in higher dimensions, 
see [26, Section 2.2]. One data set D to start with is [26, Example 2.13]. 


We now shift gears. In the remainder of this section, we discuss how machine learning 
can be helpful for theoretical research in algebraic geometry. Several articles have explored 
this theme, and we shall offer a brief guide to this emerging literature. However, it is not 
clear yet what type of information can be learned and which type of tasks can be solved by 
training neural networks. Despite its massive success in classification tasks (e.g., “Is there 
a cat in this picture?”), machine learning with neural networks has not improved explicitly 
geometric tasks such as solving systems of polynomial equations. For instance, the problem 
of reconstructing 3D scenes from images taken by unknown cameras (see Section 13.3) 
amounts to solving certain polynomial systems. There have been numerous attempts to 
solve that problem with machine learning methods. However, none of them have been as 
successful as traditional symbolic techniques with Grébner bases or resultants [158, 181]. 

The works that have used machine learning techniques to answer questions in algebraic 
geometry come roughly in two flavors. On the one hand, machine learning has been used to 
directly compute geometric properties, e.g. of Hilbert series [12], of irreducible represen- 
tations [47], or numerical Calabi—Yau (Ricci flat Kahler) metrics [59]. Those approaches 
trade off the reliability of the output with performance, and this can yield valuable insights 
into problem instances that lie outside of the scope of traditional techniques. 

Many algebro-geometric algorithms depend on a heuristic that has to be chosen by the 
user and that might heavily influence their performance. For instance, to compute a Grob- 
ner basis, a monomial ordering has to be chosen. Machine learning has been successful at 
predicting such a heuristic, which is then used to speed up the computation using traditional 
algorithms. In that way, the performance can be enhanced without compromising the relia- 
bility of the final output. This approach has been used to speed up Buchberger’s Algorithm 
by learning S-pair selection strategies [144], and Cylindrical Algebraic Decomposition by 
learning a variable ordering and by exploring Grobner basis preconditioning [97]. 

In a similar spirit, the computation of periods of hypersurfaces is enhanced in [84]. 
The authors study pencils of hypersurfaces, and they use neural networks to predict the 
complexity of the Gauss—Manin connection for the period matrix along the pencil. Based on 
that prediction, they explore all smooth quartic surfaces in P? whose polynomials are sums 
of five monomials, and they guess when their periods are computable by Gauss—Manin. 
This leads them to determine the periods of 96% of those surfaces. 

Although neural networks have not shown great potential for solving systems of poly- 
nomials, they can be trained to predict their number of real solutions [18,27]. Such a 
prediction might be used by homotopy methods that track real solutions only. This assumes 
that one has pre-computed a starting system for each possible number of real solutions, and 
— ideally — one for each chamber of the real discriminant. 

That would yield a reliable numerical computation that requires less computation time 
since it tracks fewer paths. A more drastic approach was taken in [95] where the authors 
learn a single starting solution for a real homotopy that has a good chance of reaching a 
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good solution of the desired target system. That way of computing produces a less reliable 
solution, but since they propose their method as part of a random sample consensus 
(RANSAC) scheme, bad solutions can be detected and disregarded. Since tracking a single 
solution can be very fast, they can simply repeat their approach for each bad solution. 
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Chapter 11 A 
Maximum Likelihood eag 


In Chapter 2, we discussed the problem of minimizing the Euclidean distance from a data 
point u to a model X in R” that is described by polynomial equations. In Chapter 5, we 
studied the analogous problem in the setting of algebraic statistics [167], where the model X 
represents a family of probability distributions, and we used the Wasserstein metric to 
measure the distance from u to X. Finally, in Chapter 9, we considered the distance from a 
polynomial system u to the discriminant in the context of numerical analysis. 

In this chapter, we return to statistical models, but we now replace the Wasserstein 
distance with the Kullback-Leibler (KL) divergence. This will be defined in Section 11.1. 
We show that minimizing KL divergence is equivalent to maximum likelihood estimation. 
In Section 11.2, we introduce the maximum likelihood (ML) degree, which is an analogue 
to the ED degree from Chapter 2. An interesting connection to physics is featured in 
Section 11.3, and in Section 11.4, we study the ML degree for Gaussian models. 


11.1 Kullback—Leibler Divergence 


The two scenarios of most interest for statisticians are Gaussian models and discrete models. 
We start with discrete models, where we take as the state space the finite set {0,1,...,}. 
In Chapter 5, we have worked with the closed probability simplex A,,. Here, we use the 
open simplex of probability distributions, and we denote it by 


AOS {p = (po, Pi,---» Pn) € R"! | pot pit-+++pn=1 and po, pi,---,Pn > 0}. 


Definition 11.1 The Kullback-Leibler (KL) divergence of two distributions q, p € A? is 


Dx(allp) = >) qi log(qi/pi)- (11.1) 
i=0 
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The KL divergence is not symmetric, i.e. Dxt(q||p) + Dxi(p||q) holds in general. 
Nevertheless, we interpret KL divergence as a kind of metric on the open simplex A®. 

The definition of KL divergence has the following background. Fix a random variable Z 
on the set {0, 1,...,} with probability distribution q. In information theory, the quantity 
H = — $o Gi: log(q;) is called the entropy of q. The information content of the event 
Z =i equals log(1/q;) = —log(q;), so H is the expected value of the information content. 

We explain the motivation for this definition. The probability of i and its information 
content are inversely proportional: the more likely i, the smaller is its information content. 
We anticipate events with high probability. Hence, we gain little information once we 
observe such an event. On the other hand, if we observe an event of small probability, then 
we should receive comparably more information. The entropy of q is the expected value 
of the information content, and the KL divergence Dx1(q||p) is the expected loss of 
information content, when we approximate q by p. 


Lemma 11.2 The KL divergence is nonnegative. It is zero if and only if the two distributions 
agree. In symbols, Dx. (q||p) 2 0 forall p,q € A9, with equality if and only if p = q. 


Proof We use the calculus fact that the function x + (x — 1) — log(x) is nonnegative 
for x € Ryo. Its only zero occurs at x = 1. The sum in (11.1) is bounded below as follows: 


Dxi(q||p) = Sai 2 Sred = Sa = Sn = 0. 
i=0 i=0 i=0 


i=0 
Moreover, equality holds if and only if p;/q; = 1 for all indices i. o 


Our statistical model is a subset X of the probability simplex AĮ. That subset is defined 
by homogeneous polynomial equations. As before, for venturing beyond linear algebra, 
we identify X with its Zariski closure in complex projective space P”. In this chapter, 
we present the algebraic approach to maximum likelihood estimation for X. Our sources 
include the articles [41, 63,90, 100, 101, 167] and references therein. 

Suppose we are given N i.i.d. samples. This data is summarized in the data vector 
u = (Uo, u1, . . ., Un), Whose ith coordinate u; is the number of samples that were in state i. 
We assume that u; > 0 for all i. The sample size is N = |u| := up + u1 +--+ + un. The 
associated log-likelihood function equals 


fu: Ay >R, ps uo: log(po) + ui: log(p1) + +++ + un: log(pn). 
Performing ML estimation for the model X means solving the optimization problem: 
Maximize fy(p) subject to pe X. (11.2) 


If we write q = wu for the empirical distribution in A? corresponding to u, then the 
maximum likelihood estimation problem (11.2) is equivalent to: 


Minimize Dx_(q||p) subjectto p €X, where q= yu. (11.3) 


This holds because the KL divergence can be written as the negative entropy of q minus the 
log-likelihood function: Dxi(q||p) = -H - x alp) = Dito gi log(gi) — x €u(p). 
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Therefore, viewed through the lens of metric algebraic geometry, this problem amounts 
to minimizing a certain distance, namely KL divergence, to the variety X. The objective 
function in the optimization problem (11.2) involves logarithms and is not algebraic. 
However, each of its partial derivatives is a rational function, and therefore we can study 
this problem using algebraic geometry. 


Example 11.3 The independence model for two binary random variables is a quadratic sur- 
face X in the tetrahedron A}. We studied this model under the optimization of a Wasserstein 
distance in Theorem 5.14. The model X is described by the constraints 


det Pi 


D p3 =0 and potpit+p2t+p3=1 and po,pı,p2,p3 > 0. (11.4) 


The data for this model takes the form of a 2 x 2 matrix 


u= 
u2 U3 


uo a 


The sample size of the data is |u| = uo+u1+u2+u3. 

Minimizing the KL divergence from u to the quadratic surface X means solving the 
optimization problem (11.3). This is equivalent to solving the constrained optimization 
problem (11.2). To do this, we apply Lagrange multipliers to the constraints in (11.4), 
bearing in mind that the gradient of the objective function f,(p) equals 


Véu(p) = (uo/po, ui/pi, u2/p2, u3/p3)'- 


We solve the Lagrange multiplier equations, and we find that it has a unique solution p. 
This unique critical point is the maximum likelihood estimate for the model X given the 
data u. Its coordinates are 


Po = |u|? (uotu) (uotu), pı = u|? (uot) (uy +u3), 
A -2 A -2 (11.5) 
p2 = |u| 4 (uz+u3)(uo+u2), p3 = |u|- (u2+u3)(u1+u3). 

In words, we multiply the row sums with the column sums of mye © 


Let X c P” be any fixed real projective variety, viewed as a statistical model as above. 
For any given data vector u, we write p for the optimal solution to our optimization problem 
(11.2) — (11.3) in the statistical model X N A?. Note that p is an algebraic function of u. In 
the next section, we study the algebraic geometry of the function u + Pp. A key player will 
be the very affine variety X°; see Theorem 11.7. 


11.2 Maximum Likelihood Degree 


We fix a real projective variety X in P”, and we consider the problem in (11.2) or (11.3). 
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Definition 11.4 The maximum likelihood degree (ML degree) of the variety X is defined 
to be the number of complex critical points of the optimization problem (11.2) for generic 
data u. We denote it by MLdegree(X). 


For arbitrary data u, the optimal solution in the statistical model X N A? is called the 
maximum likelihood estimate (MLE) of the model X for the data u. It is denoted by p. Thus, 
the ML degree is the analogue to the ED degree, when now KL divergence replaces the 
Euclidean distance. The ML degree measures the algebraic complexity of the MLE. 

The critical equations for (11.2) are similar to those of the ED problem in Chapter 2. 
We shall now describe these equations. Let Ix = (fi,..., fk) be the homogeneous ideal 
of the model X. In addition, we consider the inhomogeneous linear polynomial fo := 
Potpit:::+pn—-1.Let J = (0f;/Op;) denote the Jacobian matrix of size (k+1) x (n+1) 
for these polynomials, and set c = codim(X). Following Chapters 2 and 5, the augmented 
Jacobian AJ is obtained from the Jacobian matrix J by prepending one more row, 
namely the gradient of the objective function Vf = (uo/po, u1/P1, --- , Un/Pn ie To 
obtain the critical ideal, we enlarge Ix by the (c +2) x (c+2) minors of the (k +2) x (n+1) 
matrix AJ, then we clear denominators, and finally we remove extraneous components 
by saturation. Thus, MLdegree(X) is the degree of the critical ideal. 


Example 11.5 (Space curves) Let n = 3 and X be the curve in A$ defined by two general 
polynomials fı and fọ of degrees dı and dz in po, p1, p2, p3. The augmented Jacobian is 


uo/ Po uy/ Pi u2/p2 u3/ p3 
1 1 1 1 
Ofi/Opo Ofi/dp: OAfi/dp2 OAfi/dp3}~ 
ðfı/ðpo Afo/Op, Afo/AOprz Af2,/Adp3 


Since the codimension of X equals c = 2, we need to enlarge Ix by the determinant of the 
4x 4 matrix AY. Clearing denominators amounts to multiplying the ith column of AS 
by p;i, so the determinant contributes a polynomial of degree dı + dz + 1 to the critical 
equations. The generators of Ix have degrees dı and d respectively. We therefore conclude 
that the ML degree of X equals d,d2(d, + d2 + 1) by Bézout’s Theorem. ° 


AT = (11.6) 


The following general upper bound on the ML degree is established in [90, Theorem 5]. 


Proposition 11.6 Let X be a model of codimension c in the probability simplex A? whose 
ideal Ix is generated by k > c polynomials fi,..., fk of degrees dı > +--+: > dx. Then 


MLdegree(X) < didz: de: >) d'd? -~ dic, (11.7) 


Ij tigt::-+ie <n-c 
Equality holds when X is a generic complete intersection of codimension c (hence c = k). 


The formula in Theorem 11.7 below shows that, under some assumptions, the ML degree 
is equal to a signed Euler characteristic. This is the likelihood analogue to Theorem 2.13, 
which relates the ED degree to polar degrees, and hence to Chern classes (Chapter 4). 

Given our variety X in the complex projective space P”, we let X° be the open subset 
of X that is obtained by removing the hyperplane arrangement { PoP1+** Pn È; Pi) = o}. 
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Fig. 11.1: Probability tree that describes the coin toss model in Example 11.8. 


We recall from [99, 100] that a very affine variety is a closed subvariety of an algebraic 
torus (C*)”. In our setting, the open set X° is a very affine variety with r = n + 2. The 
following formula works for any very affine variety. 


Theorem 11.7 Suppose that the very affine variety X° is nonsingular. Then, 
MLdegree(X) = ej -¥(X°), 
where the latter is the topological Euler characteristic of the manifold X°. 


Proof (and Discussion) This was proved under additional assumptions by Catanese, 
Hosten, Khetan, and Sturmfels in [41, Theorem 19], and then in full generality by Huh 
in [99, Theorem 1]. If the very affine variety X° is singular, then the Euler characteristic 
can be replaced by the Chern—Schwartz—MacPherson class, as shown in [99, Theorem 2].0 


Varieties for which the ML degree is equal to one are of special interest, both in statistics 
and in geometry. For a model X to have ML degree one means that the MLE p is a rational 
function of the data u. This happens for the independence model in Example 11.3. The ML 
degree of the surface X = V(pop3 — pıp2) is one because the MLE in (11.5) is a rational 
function of the data. To be precise, p is a homogeneous rational function of degree 0 in u. 
Here is another model which has ML degree one. 


Example 11.8 Given a biased coin, we perform the following experiment: Flip the biased 
coin. If it shows heads, flip it again. The outcome of this experiment is the number of 
heads: 0, 1, or 2. This describes a generative statistical model on three states, illustrated 
in Figure 11.1. If s denotes the bias of our coin, then the model is the parametric curve X 
given by (0,1) > X c AY, s } ( s?°,s(1— s), 1 — s ). The underlying variety is the conic 
X = V(pop2 — (po + pı)pı) C P?. Its MLE is given by the formula 


(2uo + uy)? (2ug+u,)(uy+ur) uy +u? 
(2ugp+2u; tur)?” (2u9 + 2uy, tu)? 2uo+2u1+u2) 


PB = (Bo. Ài, P2) = (11.8) 


Since the coordinates of p are rational functions, the ML degree of X is equal to one. © 


The following theorem explains what we saw in equations (11.5) and (11.8): 


Theorem 11.9 Jf X c A? is a model of ML degree one, so that p is a rational function of u, 
then each coordinate pj; is an alternating product of linear forms with positive coefficients. 
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Proof (and Discussion) This was proved by Huh [100] in the setting of arbitrary complex 
very affine varieties. It was adapted to real algebraic geometry, and hence to statistical mod- 
els, by Duarte, Marigliano, and Sturmfels [63]. These two articles offer precise statements 
via Horn uniformization for A-discriminants [71]. A-discriminants are hypersurfaces dual 
to toric varieties, as we already saw at the end of Chapter 9. For additional information, we 
refer to the paper by Huh and Sturmfels [101, Corollary 3.12]. oO 


Models given by rank constraints on matrices and tensors are particularly important in 
applications since these represent conditional independence. Consider two random vari- 
ables, having nı and ng states respectively, which are conditionally independent given a 
hidden random variable with r states. For us, this model is the determinantal variety X, 
in P™™2-1 that is defined by the (r+1) x (r+1) minors of an n; Xn matrix (pij). Itappeared 
as the neuromanifold of linear fully connected neural networks in Section 10.1. The ML de- 
gree of this rank r model was first studied by Hauenstein, Rodriguez, and Sturmfels in [82], 
who obtained the following results using methods from numerical algebraic geometry. 


Proposition 11.10 For small values of ny and m, the ML degrees of low-rank models X, 
are presented in the following table: 


r\ (n1, 12) |, 3) (3,4) (3,5) (4,4) (4,5) (4,6) (5,5) 
I I © © © I I I 
10 26 58 191 843 3119 6776 
1 l 1l 191 843 3119 61326 
1 1 1 6776 
1 


(11.9) 


aA UN 


The table in (11.9) can be viewed as a Kullback—Leibler analogue to Corollary 5.20. 
Every entry in the r = 1 row equals 1 because the MLE for the independence model is a 
rational function in the data (u;;). One finds p = (f;;) by multiplying the column vector of 
row sums of u with the row vector of column sums of u, and then dividing by |u|”, as shown 
in (11.5). The other entries are more interesting, and they give precise information on the 
algebraic complexity of minimizing the Kullback—Leibler divergence from a given data 
matrix u to the conditional independence model X,. Here is an example taken from [82]. 


Example 11.11 We fix nı = m = 5. Following [82, Example 7], we consider the data 


2864 6 6 3 3 
2 7577 2 2 5 
u = 4 1 7543 2 4 
5 1 2 3809 4 
6 2 6 3 5685 


For r = 2 and r = 4, this instance of our ML estimation problem has the expected number 
of 6776 distinct complex critical points. In both cases, 1774 of these are real and 90 of 
these are real and positive. This illustrates the last statement in Theorem 11.12 below. The 
number of local maxima for r = 2 equals 15, and the number of local maxima for r = 4 
equals 6. For r = 3, we have 61326 critical points, of which 15450 are real. Of these, 362 
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are positive and 25 are local maxima. We invite our readers to critically check these claims, 
by running software for solving polynomial equations, as explained in Chapter 3. © 


The columns of the table in (11.9) exhibit an obvious symmetry. This was conjectured 
n [82], and it was proved by Draisma and Rodriguez in their article [61] on maximum 
likelihood duality. We now state their result. Given an nı X m2 matrix u, we write Qu for 
the matrix whose (i, j) entry equals 


Uj Uj4U4 7 
(u+)? 
Here, we use the following notation for the row sums, the column sums, and the sample size: 


n2 ny nm m2 


Ui+ = > uij, j= 3 uij, and Uu = > > ij- 


j=l i=l j=l 


In the following theorem, the symbol x denotes the Hadamard product (or entrywise 
product) of two matrices. That is, if p = (pij) and q = (qij), then p * q = (pi; - qij). 


Theorem 11.12 Fix nı < m and u an nı X m-matrix with strictly positive integer entries. 


There exists a bijection between the complex critical points P1, P2, . . . , Ps of the likelihood 
function for u on X, and the complex critical points q1, Q2, . . . , Qs on Xn,-r+1 such that 
Pi * qı = P2*Q2 = °°: = Ps*qs = Qu. (11.10) 


In particular, this bijection preserves reality, positivity, and rationality of the critical points. 


This result is a multiplicative version of Theorem 2.23 on duality for ED degrees. By 
“multiplicative” we mean that u;/p; instead of u; — p; appears in the first row of the 
augmented Jacobian matrix. Theorem 11.12 concerns the case of determinantal varieties. 
In the ED theory, these led us to the Eckart-Young Theorem. 

It is a challenge in intersection theory and singularity theory to find general formulas 
for the ML degrees in Proposition 11.10. This problem was solved for r = 2 by Rodriguez 
and Wang in [154]. They give a recursive formula in [154, Theorem 4.1], and they present 
impressive values in [154, Table 1]. They unravel the recursion, and they obtain the ex- 
plicit formulas for the ML degree of conditional independence models in many cases. In 
particular, they obtain the following result, which had been stated as a conjecture in [82]. 


Theorem 11.13 (Rodriguez—Wang [154]) Consider the variety X) c P?""! whose points 
are the 3 x n matrices of rank at most two. The ML degree of this variety is 


MLdegree(X2) = 2"! — 6. 
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11.3 Scattering Equations 


We now turn to a connection between algebraic statistics and particle physics that was 
developed in [165]. The context is scattering amplitudes, where the critical equations 
for (11.2) — (11.3) are the scattering equations. We consider the CEGM model, due to 
Cachazo and his collaborators [37,38]. The role of the data vector u is played in physics 
by the Mandelstam invariants. This theory rests on the space X° of m labeled points in 
general position in P‘~!, up to projective transformations. 

Consider the Grassmannian Gr(k — 1, P’"“') in its Pliicker embedding into p(@)-!. The 
torus (C*)” acts on this by scaling the columns of k x m matrices representing subspaces. 
Let Gr(k—1, P””~!)° be the open Grassmannian where all Pliicker coordinates are non-zero. 


Definition 11.14 The CEGM model is the (k — 1)(m — k — 1)-dimensional manifold 
X? = Gr(k-1,P™ yc) (11.11) 


Example 11.15 (k = 2) For k = 2, the very affine variety in (11.11) has dimension m — 3, 
and itis the moduli space of m distinct labeled points on the complex projective line P!. This 
space is ubiquitous in algebraic geometry, where it is known as Mo,m. The punchline of our 
discussion here is that we interpret the moduli space Mo, as a statistical model. And, we 
then argue that its ML degree is equal to (m — 3)!. For instance, if m = 4 then X° = Mo 4 
is the Riemann sphere P! with three points removed. The signed Euler characteristic of this 
surface is one, and Theorem 11.9 applies. © 


Proposition 11.16 The configuration space X° in (11.11) is a very affine variety, with 
coordinates given by the k x k minors of the k X m matrix 


Mk,m = [A B], 


where the matrices A € RF and B € R*"-®) are given by 


0 0 0 oe 0 (-1)* 
0 0 0 oe (-1)*! 0 
Ace 
0 0 -1 0 0 
0 1 0 0 0 
-1 0 0 0 0 
1 1 1 as 1 
X1,1 X1,2 see X1,m-k-1 
and B= : : : 
Xk-3,1 Xk-3,2 tee Xk-3,m-k-1 
Xk-2,1 Xk-2,2 oo Xk-2,m-k-1 
Xk-1,1 Xk-1,2 oo Xk-1,m-k-1 


We denote by pi, iy...i, the k X k minor of Mg m with columns i, < i2 < +++ < ig. 
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Following [2, Equation (4)], the antidiagonal matrix in the left k x k block of Mk,m 
is chosen so that each unknown x;,; is precisely equal to the minor pj;,;,...;, for some 
ij <i <: <i. No signs are needed. 


Definition 11.17 The scattering potential for the CEGM model is the function 
fa =. J, Unei “108 (Pinte): (11.12) 
iiiz: ik 


This is a multi-valued function on the very affine complex variety X°. Here, u = (uj, i,...i,) 
is the data vector (its coordinates are called Mandelstam invariants in physics) and the 
Piivi are the coordinates of the open Grassmannian in the Pliicker embedding. 


The critical point equations, known as scattering equations [2, equation (7)], are given by 


Olu 
OX;, 


= 0 forl<i<k-landl<j<m-k-l. (11.13) 
These are equations of rational functions. Solving these equations is the agenda in the 
articles [37,38, 165]. 


Corollary 11.18 The number of complex solutions to (11.13) is the ML degree of the CEGM 
model X°. This number equals the signed Euler characteristic (—1)“~)("--) . y(X°). 


Example 11.19 (k = 2,m = 6) The very affine threefold X° = Mo, sits in (C*)? via 


P24 = X1, P25 = X2, P26 = X3, P34 = X1 — 1, p35 =X2 — 1, 
P36 = x3 — l, pas = X2 — X1, P46 = X3 — X1, P56 = X3 — X2. 


These nine coordinates on X? c (C*)? are the non-constant 2 x 2 minors of our matrix 


0 1 1 1 1 1 


M26 = -l (0) 1 X1 X2 X3 


The scattering potential is the analogue to the log-likelihood function in statistics: 


fu = uz4log(pz4) + u25 log( p25) + --- + use log(ps6). 
This function has six critical points in X°. Hence, MLdegree(X°) = —y(X°) = 6. ° 
We now examine the number of critical points of the scattering potential (11.12). 


Theorem 11.20 The known values of the ML degree for the CEGM model (11.11) are as 
follows. For k = 2, the ML degree equals (m — 3)! for all m > 4. For k = 3, the ML degree 
equals 2,26, 1272, 188112, 74570400 when the number of points is m = 5,6,7,8,9. For 
k = 4 and m = 8, the ML degree equals 5211816. 


Proof We refer to [2, Example 2.2], [2, Theorem 5.1], and [2, Theorem 6.1] for k = 2,3, 4.0 
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Knowing these ML degrees for the CEGM model helps in solving the scattering equa- 
tions reliably. It was demonstrated in [2, 165] how this can be done in practice with the 
software HomotopyContinuation. jl [31]. For instance, [165, Table 1] discusses the 
computation of the 10! = 3628800 critical points for k = 2 and m = 13. See [2, Section 6] 
for the solution in the challenging case k = 4 and m = 8. 

One purpose of this short section was to demonstrate that ML degrees of very affine 
varieties X° appear in many scenarios, notably in physics, well beyond statistical models. 
By connecting these scenarios to algebraic statistics, both sides benefit. Metric algebraic 
geometry offers a framework for developing such connections. See also [7] for connecting 
ML estimation with norm minimization over a group orbit in invariant theory. 


11.4 Gaussian Models 


We now change the topic by turning to statistical models for Gaussian random variables. 
Let PD,, denote the set of positive-definite symmetric n X n matrices, i.e. matrices all of 
whose eigenvalues are positive. This is an open convex cone in the real vector space S? (R”) 
of symmetric n X n matrices, which has dimension ear This cone now plays the role 
which was played by the simplex A? when we discussed discrete models. 

Given a mean vector u € R” and a covariance matrix £ € PD,, the associated Gaussian 


distribution is supported on R”. Its density has the familiar “bell shape”; it is the function 


1 
VOT)" det(S) 


We fix a model Y c R” x PD, defined by polynomial equations in (u, £). Suppose we 
are given N samples U“!),...,U°%). These samples are vectors in R”. These data are 
summarized in the sample mean U = x LY 1U © and in the sample covariance matrix 


faz) -exp(— 4(x - p) "E~! (x - p)). 


N 
1 ENEE ON 
S=—) (U® -O (U® - ÜV". 
r 2, ( ) 
With this data matrix, the log-likelihood is the following function in the unknowns (p, X): 


£(u,Z) = 2 log det(Z) + trace(S£7!) + (O-p)'X "(U0 -p)}. (1.14 


The task of likelihood inference is to maximize this function subject to (u, £) € Y. 

There are two extreme cases. First, consider a model where È is fixed to be the identity 
matrix /,. Then Y = X x {I„} and we are supposed to minimize the Euclidean distance 
from the sample mean Ü to the variety X in R”. This is precisely the earlier ED problem. 


We instead focus on the second case, when Y = R” x X for some subvariety X of 
S?(R"). Then, any maximum (p, =) € Y of (11.14) satisfies u = U, and so the problem is 
reduced to estimating the covariance matrix = € X. Hence, by shifting all samples until U 
becomes 0, we may assume that our model has the form Y = {0} x X and our task is 
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Minimize the function £ > log det(£) + trace(S=X~!) subject to £ € X. (11.15) 
Using the concentration matrix K = =~!, we can write this equivalently as follows: 

Maximize the function K > log det(K) — trace(SK) subject to K € X`. (11.16) 
Here, the variety X~! is the Zariski closure of the set of inverses of all matrices in X. 


Remark 11.21 The optimization problem (11.15)-(11.16) has a metric interpretation as 
in (11.3). Namely, we can define the KL divergence between two continuous probability 
distributions on R” by replacing the sum in (11.1) with the corresponding integral over R”. 
That way, we obtain a certain kind of distance between the unknown matrix È and the given 
sample covariance matrix S. 


The critical equations for (11.15)-(11.16) can be written as polynomials since the partial 
derivatives of the logarithm are rational functions. These equations have finitely many 
complex solutions. Their number is the ML degree of the statistical model X~!. 

In the remainder of this section, we focus on Gaussian statistical models that are de- 
scribed by linear constraints on either the covariance matrix or its inverse, which is the 
concentration matrix. We consider a linear space of symmetric matrices (LSSM), 


L£ c SR”), 


whose general element is assumed to be invertible. We are interested in the models X~! = £ 
and X = Z. It is convenient to use primal-dual coordinates (X, K) to write the respective 
critical equations. For a proof of the next result, see [166, Propositions 3.1 and 3.3]. 


Proposition 11.22 Fix an LSSM £ and its orthogonal complement L+ under the Euclidean 
inner product (X,Y) = Trace(X'Y). The critical equations for the linear concentration 
model X~! = L are 

KeZ£ and K=], and ©-SeL-. (11.17) 


The critical equations for the linear covariance model X = L are 


Le Land KX =1, and KSK -K e Lt. (11.18) 


The system (11.17) is linear in the unknown matrix K, whereas the last group of equations 
in (11.18) is quadratic in K. The numbers of complex solutions are the ML degree of £ 
and the reciprocal ML degree of £. The former is smaller than the latter. 


Example 11.23 Let n = 4 and let £ be a generic LSSM of dimension k. The degrees are: 


k = dim( L) : 2 3 4 5 6 7 8 9 
MLdegree : 3 9 17 21 21 17 9 3 
reciprocal MLdegree : 5 19 45 71 81 63 29 7 


These numbers and many more appear in [166, Table 1]. © 
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ML degrees and reciprocal ML degrees of linear spaces of symmetric matrices have 
been studied intensively in the recent literature, both for generic and special spaces L. 
See [6, 22,67] and the references therein. We now present an important result due to 
Manivel, Michałek, Monin, Seynnaeve, Vodička, and Wisniewski. Theorem 11.24 para- 
phrases highlights from their articles [126, 132]. 


Theorem 11.24 The ML degree of a generic linear subspace L. of dimension k in S*(R") 
is the number of quadrics in P™! that pass through (ee —k general points and are tangent 


to k — 1 general hyperplanes. For fixed k, this number is a polynomial in n of degree k — 1. 


Proof The first statement is [132, Corollary 2.6 (4)], here interpreted classically in terms 
of Schubert calculus. For a detailed discussion, see the introduction of the article [126]. 
The second statement appears in [126, Theorem 1.3 and Corollary 4.13]. oO 


Example 11.25 Let n = 4. Fix 10 — k points and k — 1 planes in P’. We are interested in all 
quadratic surfaces that contain the points and are tangent to the planes. These points and 
planes impose 9 constraints on P(S*(C*+)) =~ P’. Passing through a point imposes a linear 
equation. Being tangent to a plane is a cubic constraint on P’. Bézout’s Theorem suggests 
that there could be 3*~! solutions. This is correct for k < 3, but it overcounts for k > 4. 
Indeed, in Example 11.23, we see 17,21,21,... instead of 27, 81, 243,... © 


The intersection theory approach in [126, 132] leads to formulas for the ML degrees 
of linear Gaussian models. From this, we obtain provably correct numerical methods for 
maximum likelihood estimation, based on homotopy continuation. Namely, after computing 
all critical points as in [166], we can certify them with interval arithmetic as in [30]. Since 
the ML degree is known beforehand, one can be sure that all solutions have been found. 
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Check for 
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Tensors | : 


Tensors are generalizations of matrices. They can be viewed as tables of higher dimension. 
A 2 x 2-matrix is a table that contains four numbers aligned in two directions, and each 
direction has dimension 2. Similarly, a 2 x2 x 2-tensor is a table with eight numbers aligned 
in three directions where each direction has dimension 2. 


Fig. 12.1: A 2 x 2 matrix A and a2 x 2 x 2 tensor B. 


In this chapter, we offer an introduction to tensors, with a view toward metric algebraic 
geometry. We present the notion of tensor rank, along with the Segre variety and the 
Veronese variety. These arise from tensors of bounded rank. We then turn to the spectral 
theory of tensors, and we show that eigenvectors and singular vectors arise naturally in the 
study of Euclidean distance problems. In the last section, we study volumes in complex 
projective space, and we determine these volumes for Segre and Veronese varieties. 


12.1 Tensors and their Rank 


For positive integers nı, . . . , nq, the vector space of real nı x --- X ng tensors is denoted by 


nyXXNdq i = ; 7 Á : 
R = {A a Ce 2) ee eee ere lii, iq E R}. 
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In the literature, elements in the vector space R’'*"""*"4 are also called hypermatrices or 
Cartesian tensors, because our definition of a tensor as a multidimensional array relies on a 
basis. The dimension of the vector space R’!*""*"4 is n; --- ng. We also consider the space 
of complex tensors C”!*“"""*"4, The number d is called the order of the tensor. Tensors of 
order one are vectors. Tensors of order two are matrices. 

A common notation for order-three tensors is by writing them as a tuple of matrices. 


Example 12.1 The tensor B = (bj x), <; j.k<2 from Figure 12.1 can be written as 


= 12 5 6 2x2x2 
so sare eR 


The left matrix contains the entries b;jı and the matrix on the right contains the b;j2. © 


The Euclidean inner product of two tensors A = (ai ,...,i4) and B = (bi ,...,ia) is defined as 


The resulting norm on R"*"*"4 is given by the familiar formula ||A|| = (A, A). 

One way to create a tensor is to take the outer product of d vectors v; € R’, where 
1 <i < d. This tensor is defined by taking all njn2---ng possible products between the 
entries of distinct vectors: 


V18: 8 Va = ( (Vii e Wadia lies acid (12.1) 


The outer products of the standard basis vectors e;, € R’ give the standard basis 


os = a i 
of R" Xna, Tn particular, we have (ai,..ia) = Baat Bio linia Ch DO Cig 
Ifn := n = ++- = na are all equal, then we write the tensor space as (R”)®¢ := RX”, 


Each permutation 7 € Gy acts on outer products in (R”)®¢ by permuting the factors: 
T(V] @ ++: 8 Va) = Va(1) 8+: ® Vz(a). This yields an action of the symmetric group Gg 
on (R")®¢ via linear extension. For d = 2, this is transposition of matrices. 


Definition 12.2 A tensor A € (R”)®4 is called symmetric if n(A) = A for all z € Gg. We 
use the following notation for the vector space of symmetric tensors: 


S4(R”") := {A e€ (R")® | A is symmetric}. 


Being a symmetric tensor is a linear condition. Hence, the space S“(R") of symmetric 
tensors is a linear subspace of (R”)®¢. For v € R”, we write the d-fold outer product as 


ved = y@---@v € S4(R"). 


Every symmetric tensor A € S¢(R”) determines a homogeneous polynomial of degree d: 


F(x) = 5 > Rae 2 Qi ,i2,.. iq Xi Xi ** * Xig- (12.2) 


i=l hl ig=l 
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This is a polynomial in n variables x1, . . . , Xn. The map A +> F34 is anisomorphism between 
S1 (R”) and the vector space of homogeneous polynomials of degree d in n variables. Their 
n+d-1 


common dimension is ( q ). We can write the polynomial in (12.2) as 
Fa(x) = (A,x®*). 
For A € S4¢(R”), we have ||A]| = ||F4|lpw. This is the Bombieri-Wey] metric (Chapter 9). 


Example 12.3 Symmetric tensors of order three are in bijection with homogeneous cubics. 
For instance, symmetric 2 x 2 x 2 tensors correspond to binary cubics: 


For A=[ 47 |Y} ]es (R, wehave F4(x) =x} - 3xix + 2x3. 
This generalizes the familiar bijection between symmetric matrices and quadratic forms. 


Given a tuple of matrices (M,,..., Ma) € RE?™ x --. x Rka*"a, we define the multi- 
linear multiplication by letting each matrix act on one side. For outer products, we have 


(M,,...,Ma).(v1 8 ++: @vg) := (My,v1) 8 --- @ (Mgvgq). 


This action extends by linearity to all of R™*""*"4. For kj = nı, ..., kq = na, this induces 
a representation of GL(n,) x --- x GL(ngq) into the general linear group of R"*""*"4. If 
d = 2, then (M, M2).A = M- 1AM, . Thus, multilinear multiplication is a generalization of 
simultaneous left-right multiplication for matrices. 

A central topic of this chapter are rank-one tensors. A tensor A has rank one if it has 
the form (12.1). For d = 2, this gives us nı X nz matrices of rank one. The inner product 
between rank-one tensors is 


(V1 @ +++ @Va,Wi @-++@Wa) = (V1, W1) t: (Va, Wad). (12.3) 


Definition 12.4 The (real) Segre variety Sn, 
rank one, i.e. 


nq consists of all nı X --+ X ng-tensors of 


yars 


na (= {V18 8 Va | v; E R™,1 <i< d}. 


ros 


We simply write S when nj,...,mq are clear from the context. The most immediate 
way to see that S is an algebraic variety goes as follows. We can always flatten a tensor 
A € R” Xna into d matrices F}, ..., Fq, where F; € R”":*Uljz:”"3). Then, we have A € S 
if and only if the column span of F; has dimension at most one for 1 < i < d. This is 
equivalent to saying that the F; are all of rank at most one; i.e., their 2 x 2-minors vanish. 
The dimension of the Segre variety is dim S = nı +++-+na+1-d. 

The analogue of the Segre variety for symmetric tensors is the Veronese variety. 


Definition 12.5 The Veronese variety is the variety of symmetric tensors of rank one: 
V := Vaa := {v®4 | v € R"}. 


This is the intersection of the Segre variety S with a linear subspace. Hence, V is an 
algebraic variety. Note that dim V = n. 
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Example 12.6 Let v = (x, y) € R?. Then, v®? comprises all monomials of degree three: 


2 3 


33 E xy 
xy? y 


= xy xy? 


xy xy? | 


Here, the Veronese variety V is a surface. Namely, it is the cone over the twisted cubic 
curve. In general, the distinct entries of v®¢ are all monomials of degree d in the entries of 
the vector v. The Veronese variety V is parametrized by these monomials. © 


The Segre variety induces a notion of rank of a tensor A € R”!*"""*"¢, Namely, we define 
rank(A) := min {r > 0 | there exists Aj,...,A; E€ S with A = A, +--+ A}. 
For matrices (d = 2), this is the usual matrix rank. We denote tensors of rank at most r by 


ng = {Ae R" "a | rank(A) <r}. 


A tensor of order three and rank r can be visualized as follows: 


ti 27 
= [m l 


In the case of matrices, the set R, is a variety. It is the common zero set of all (r+1)x(r+1)- 
minors of the matrix. For d > 3 andr > 2, however, R, is no longer a variety. This is 
implied by the following result due to de Silva and Lim [53]. 


Proposition 12.7 The set R2 of tensors of rank at most two is generally not closed in the 
Euclidean topology. 


Proof Fix d = 3 and nı = m = m = 2. Consider linearly independent pairs {x,, x2}, 
{y1,y2}, and {z1, z2} in R?. For any € > 0, we define a rank-two tensor A, € R3 as follows: 


As := e7! (xı + £X2) ® (yı + €72) Q (Z1 + E72) — ex, & yı 8 Z. (12.4) 


The limit tensor equals A := limg_,9 Ag = X1 @Y| 8Z2+X1 8 Y2 OZ, +X2 BY; &Z1. We claim 
that A has rank three. For h € RÊ, we apply multilinear multiplication by X := (Jo, h, h7) 
to A. This gives the 2 x 2-matrix 


X.A = (h,z2): xı 8 yı + (h, zı) -X1 8 y2 + (h, Z1) : X2 8 y1. (12.5) 


Choosing h with (h, zı) + 0 yields a matrix of rank two, and hence A has rank at least 
two. We suppose now that A = u; ® vj ® W; + Ww ® V2 ® Wz has rank two. Among the 
three pairs of vectors, at least two pairs are linearly independent. Suppose the pairs (u1, u2) 
and (v1, V2) are linearly independent. Then 


X.A = (h, w1) - u; ® vı + (h, w2) - w 8 Vo. (12.6) 
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Let us now pick a non-zero h such that (h, z;) = 0. By (12.5), the matrix X.A has rank one 
and hence (h, w1) = 0 or (h, w2) = 0 by (12.6). We may assume (h, w1) = 0. This implies 
that w; is a multiple of zı, since both are perpendicular to h. We distinguish two cases. 
First, if the pair (w1, w2) is linearly independent, there is another h’ with (h’,w,) # 0 
but (h’, w2) = 0. Then, (h’,z;) + 0 and so X.A has rank two by (12.5) and it has rank one 
by (12.6); a contradiction. Second, if w1, w2 and z; are multiples of one another, then the 
tensor A is of the form A = (A,u, ® Vj + A4202 ® V2) ® Z| (i.e., its “third row space” is 
spanned by z1). This is a contradiction to A = X1 @ y1 8 Z2 + (X1 @y2 8 +X2 Q y1) Q Z1. In 
both cases, we have derived a contradiction, so A cannot have rank two, but it must have 
rank three. Finally, the same proof extends to larger tensor formats. Namely, we can simply 
embed A into a larger tensor format while retaining the relevant rank properties. o 


A decomposition of a tensor A of the form A = A; +---+ A, with A; € S is called a 
rank-r decomposition. In the signal processing literature, itis also called canonical polyadic 
decomposition. One appealing property of higher-order tensors is identifiability. That is, 
many tensor decompositions are actually unique (while matrices of rank at least two do not 
have a unique decomposition). The following is [48, Theorem 1.1] and [149, Lemma 28]. 


Theorem 12.8 Let d > 3 and nı > +++ > ng with T]4, ni < 15000, and set 


dim RX Xna 
ee 


nı: Nd | 


dim S 1+ 02, (ni - 1) 
Suppose that r < rq and the tuple (n,,...,ng,1r) is not among the following special cases: 
(ni,..., Na) r 
(4,4,3) 5 
(4,4,4) 6 
(6,6,3) 8 
(n,n, 2, 2) 2n-1 
(2, 2, 2, 2, 2) 5 


m > Z; ni - X£, (ni — 1) r > Z; ni - Z2, (n - 1) 


Then, a general tensor in R, has a unique rank-r decomposition. 


Remark 12.9 There are other notions of tensor rank, and these give alternative decompo- 
sitions. Let k = (k1, ..., ka) be a vector of integers with 1 < k; < ni. Let A € RX Xna 
and F|,..., Fa be the flattenings of A. Then, we say that A has multilinear rank (at most) k 
if rank(F;) < k; for all 1 < j < d. A block term decomposition of the tensor A has the 
form A = A; +- - -+ A,, where each A; has low multilinear rank k. For order-three tensors, 
a block term decomposition can be visualized as follows: 
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The rank decomposition is the special case k = (1,..., 1). In statistics, this corresponds 
to mixtures of independence models. Block term decompositions represent mixtures of 
discrete probability distributions that are more complex. Identifiability for block term 
decompositions is less studied than for rank decompositions. Some results, primarily for 
the case d = 3, can be found in [118] and references therein. o 


12.2 Eigenvectors and Singular Vectors 


In this section, we study the Euclidean distance (ED) problem for the Segre variety and 
the Veronese variety. We start with the observation that each of them is the image of a 
polynomial map. The maps are 


y :R™” x- xR" — S, (V1,---,Va) > V18: Q Vq, and 
(12.7) 
v:R" > VY, ve ved, 


We first consider the ED problem for the Segre variety S. Given any tensor A € R” %”a, 
we seek the rank one tensor B which is closest to A in Euclidean norm. Thus, we must 
solve the optimization problem 


min ||A— B|| = min ||A—yw(vy,...,Vva)ll. (12.8) 
BeS v; ER" 

Consider the best rank r approximation for a tensor A of order d > 3. This means computing 
minger, ||A — B||. For r = 1, this is precisely (12.8), and we shall present the solution in 
this section. For r > 2, this problem can be ill-posed (i.e., no minimizer exists). In fact, 
Proposition 12.7 implies that this problem is ill-posed for all tensors A in a full-dimensional 
open subset of R"'*"""*"4, This was first noted by de Silva and Lim [53]. This is different for 
matrices, where the Eckart-Young Theorem (Theorem 2.9) provides an explicit algorithm 
for computing the minimizer. Likewise, for r = 1, the image of the Segre map y is closed, 
and hence the problem (12.8) has a solution B € S. Our goal is to identify this B. 

The critical values of the distance function S — R, B + ||A — B|| are in one-to-one 
correspondence to the critical values of 


R" x- xR” > S, (W,...,Va)  ||A-W(v1,.--. Vall 


up to scaling (v1, ..., Va) = (f1V1,.--,taVa) with tı ---tqg = 1. Our goal is therefore to 
find the critical points of the function that maps the tuple (v1,..., Va) to 


|A-w(vi,...,va)Il? = |All? -2¢(A.w(v1,---.Va)) + y, VD 2.9) 


We compute the gradient of this function and set it equal to zero. This yields the critical 
points B we are interested in. By (12.3), the norm of w can be written as 


2 2 2. 
(vi... Vall = Ulva ++ vall. 
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The gradient of (12.9) with respect to vg therefore equals 


— 2 (A, W(v1,.-.,Va)) + g&l? e Ivall?) 


(A, V1 @ +++ @VK-1 Q €11 @ Vk @- ++ @ Vag) 


(12.10) 
= -2 + (2, ] vl?) Vk. 
(A, V1 D- O Vk- 8 en, Q Vk @ +++ @ Va) jtk 
Here e1, ... , ên, is the standard basis of R’*. Abbreviating ox := [] j4x Iv ;||?, we write 
the critical equations A |A —wW(v1,..-,Va)||? = 0 from (12.10) in compact form: 
A®@jzkVj = Ok:Vk for k=1,...,d. (12.11) 


For d = 3, we can visualize this as follows: 


= ALA (12.12) 


Definition 12.10 Let A € R!” "a, We say that (v1,...,Vq) € C™ x---x C™, v; #0, 
is a singular vector tuple for A if the equation (12.11) is valid for some o1,...,7q € C. If 
this holds, then (t;v),...,tgVq) satisfies the same condition, for all t),...,t¢ € C \ {0}. 
Thus, singular vector tuples are considered only up to scaling. 


For d = 2, this coincides with the classic definition for rectangular matrices. The singular 
vector pair consists of the left singular vector and the right singular vector. The singular 
value decomposition implies that a general matrix A € R”? has precisely min{n1, n2} 
real singular vector pairs. For higher-order tensors, not all singular vectors need to be real. 


Proposition 12.11 The critical points (over C) of the ED problem for the Segre variety S 
are the tuples of singular vectors. Hence, the Euclidean distance degree of S equals the 
number of singular vector tuples. 


Proof The computation in (12.10) shows that every critical point of the optimization 
problem (12.8) is a singular vector tuple of the tensor A. Conversely, consider a solution 
of (12.11). Multiplying (12.11) from the left with vi yields that o; - ||v,||? = A © CHAS 
is independent of k. Thus, t := o% - lvl? / 4 DAK is independent of k. Now, rescaling 
(Vi, -.., Va) by (t,...,tq) such that t1 - t2 -++ ta = t yields a new solution of (12.11) that 
satisfies o% = []jzx IIV Fig for all k. Hence, the critical points are in bijection with the 


singular vector tuples, assuming that these are viewed as points in P™~! x... x P”4-1, o 


Example 12.12 (d = 3) Fix an nı Xn Xnz tensor A = (a; ;,). Our unknowns are the vectors 
Vi = (X1... Xn) SP = Oy) CPE and v3 = (Z1, ..., Zn) EPO. 
The singular vector equations (12.11) are 


n n . 
dja Lear lijk YjZk =o \x; for i=1,2,...,n, 


n n3 _ . 
Diet pias GijkXiZk = 92); for j=1,2,...,m, 
ni 
j= 


n 
1 La jel lijk XiYj = O3Zk for k=1,2,...,n3. 
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To work modulo scaling, we set x} = yı = zı = 1. Then this is a square system of 
polynomial equations, and it has finitely many solutions for generic tensors A = (a; jx). 
For instance, the number of solutions is 6 when nı = nm = n3 = 2 and it is 37 when 
nı = m = m = 3. This will be explained by Theorem 12.13. 

To obtain the critical points of our ED problem (12.8), we retain x1, y1, z; as unknowns, 
and we add the constraints ox = [] 4x IIv lI? fork = 1,2, 3. Explicitly, for nı = nz = n3 = 2, 
these constraints are the polynomial equations 


(xt +x5)(yp +3) = 03, (axli +) =o, and (yp +y3)(z7 +25) =o. 


For each singular vector triple, we obtain a curve of solutions that maps to the same critical 
tensor B of rank one, up to scaling. For instance, for nj = n2 = n3 = 2, there are six 
such curves. Each of them has degree 12. In other words, the formulation on the right 
of (12.8) requires some care because the rank one factorization (12.1) is not unique. By 
using singular vector tuples up to scaling, we are on the safe side. © 


The following theorem of Friedland and Ottaviani [68] gives a formula, in terms of 
n1, n, .. ., Nq, for the number of singular vector tuples, and so for the ED degree of S. 


Theorem 12.13 Let A € R™* "a be a general tensor. The number of singular vector 


tuples of A, counted up to scaling (V\,...,Va) = (t1V\,---,taVa), is the coefficient 
n,-1 Nna-1 - ` 
ofx xg in the polynomial 
d Ni Ni 
fi i 
[= . where fi := Be 
i1 SX jti 


Example 12.14 The formula in Theorem 12.13 for binary tensors A € R7**? gives 


d 2 2 d 
Si = %; d 
I] a = | [Gi +x) = (xy +x2 +: + xq)". 
i=l 7! t i=1 


The coefficient of x1x2 + - xa in this polynomial equals d!. In particular, the ED degree of 
the Segre variety S in R??? is found to be 3! = 6. © 


We turn from general tensors to symmetric tensors, and we consider the ED problem for 
the Veronese variety V in Definition 12.5. Now A is asymmetric tensor, all v; are equal to 
the same vector v in R”, and the gradient in (12.10) does not depend on k. Setting 4 := ox, 
the critical equations for the ED problem on ‘VV are 


Aey) = ay. (12.13) 


Similarly to (12.12), we visualize this equation for symmetric tensors of order d = 3: 
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Definition 12.15 Let A € S“(R”). We call v € C”\{0} an eigenvector of A if (12.13) holds 
for some 4 € C. The pair (v, 2) is called an eigenpair of the symmetric tensor A. 


Eigenpairs have another interesting interpretation, next to being critical points for the 
Euclidean distance problem on the Veronese variety V. Recall that every symmetric tensor 
A € S4(R") corresponds uniquely to a homogeneous polynomial F4(x) = (A,x®¢) of 
degree d in n variables. Under this interpretation, the vector A è x®(¢-!) on the left in 
(12.13) is the gradient vector VF'4(x) = (OF 4/0x1,...,0F4/0Xx,)". Therefore, eigenpairs 
correspond to fixed points of the gradient map P”! --> P”"!, x KH VFa(x). 


Example 12.16 (n = 3) Symmetric tensors of order d and format 3 x 3 x--- x3 correspond 
to ternary forms F (x1, x2, x3) of degree d. The eigenvectors of such a tensor are solutions of 


OF /0x, = Axı, OF [0x2 = Axo, OF [0x3 = Ax3. 
Eliminating A is equivalent to the statement that the following matrix has rank one: 


X1 X2 X3 


OF /0x\ OF /0x2 OF /0x3 : 


This is a dependent system of three homogeneous equations of degree d in three variables. 
The number of solutions (x; : x2 : x3) in p? equals d -d + 1. For d = 2, these are just 
the three eigenvectors of a symmetric 3 x 3 matrix. For d = 3, one interesting example 
is F = x$ + 5 + a The corresponding 3 x 3 x 3 tensor has 7 eigenvectors, namely the 


elements of {0, 1}3\{0}°. How about F = xi + x + x for d > 4? © 


We now remove the assumption that A is symmetric. Definition 12.15 makes sense for 
any A € (R")®4, We lose the interpretations above, but we gain a general definition. The 
number of eigenpairs of a general tensor was found by Cartwright and Sturmfels in [39]. 


Theorem 12.17 Let A € (R”)® be a general tensor. The number of complex eigenvectors 
of A, up to scaling v +> tv, is the sum 


n-1 
Xa- Di. 
i=0 


Proof For d = 2, the sum gives n, which is the number of eigenvectors of a general matrix 
A € R’". Hence, we assume d > 2 from now on. We use the same proof strategy as for 
Corollary 3.15. Consider the symmetric tensor A with 


Fa (x) Se ee oe 


The equations for its eigenpairs are 


d-1 d-2 d-1 d-2 d-1 d-2 
xi ESN =x END =o = aa E Xn = O, 


where A = &4-? is the eigenvalue. We count that this system of homogeneous polynomial 
equations has precisely (d — 1)” regular solutions in P”. By Bézout’s theorem, a general 
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system of n polynomials with degrees (d — 1,...,d — 1) also has (d — 1)” regular zeros. 
The Parameter Continuation Theorem 3.18 then implies that the analogous equations for 
a general tensor A € (R”)®@ also have (d — 1)” regular zeros. We remove the extraneous 
solution in P” that is given by £ = 1 and x = 0. The remaining (d — 1)” — 1 solutions come 
in clusters of d — 2, where £ runs over the (d — 2)-th roots of unity. After taking the image 
of the projection (£, x) +> x, the number of distinct points x in P”! is the quotient 


a = - Şu- 


We conclude that a general tensor of format n x --- x n has Xr- a(d — 1)! eigenvectors. O 


For matrices, every n-tuple of linearly independent vectors can be eigenvectors of a 
matrix. For tensors, this is not so. Abo, Sturmfels, and Seigal proved in [1] that a set of d 
points v1, ..., Va in C? is the eigenconfiguration of a symmetric Te in S¢(C?) if and 
+ Zk annihilates 


only if d is ödü, or d = 2k is even and the differential operator (2 2a 
the binary form f, (x, y) := IIZ; (bix - aiy), where v; = (ai, bi), for alli=1,...,d. The 
extension of this result to any number of variables was studied by Beorchia, Galuppi, and 
Venturello in [17]. This study of eigenschemes continues to be an area of active research. 

In this section, we showed that the ED problem for Segre varieties and Veronese 
varieties leads directly to singular vectors and eigenvectors of tensors. By the ED duality in 
Theorem 2.23, it is equivalent to study the ED problem for the dual hypersurfaces, which 
are the hyperdeterminant and the discriminant, respectively. This, in turn, takes us to the 
geometry of condition numbers, as discussed in Chapter 9. A natural generalization arises 
from the Segre—Veronese varieties, whose polar degrees we saw in Theorem 5.18. 


12.3 Volumes of Rank-One Varieties 


We now shift gears, and we examine the metric geometry of the Segre variety and the 
Veronese variety. Both are now taken to be complex projective varieties. To this end, we 
pass to complex projective space P, where N = nı ++ -na — 1, or N = nf — 1 if the n; are 
all equal. Our two complex projective varieties are 


Sp := {v1 8 Qva E PX |v; e P"! 1 <i<d} and 


(12.14) 
Vp := {ved eP“ |v,eP™'}. 
Thus, Sp consists of rank-one tensors v1 ®--- ® vg, where v; € C”, taken up to scaling by 
C*. Similarly, Ve consists of all vectors v®¢ for v € C”, again taken up to scaling by C*. 
Our aim is to compute the volumes of Sp and Vp. The first crucial step is to explain 
what this means. In Section 6.3 and Chapter 14, we discuss volumes of real semialgebraic 
sets. But here, the situation is different. In this section, our objects are lower-dimensional 
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submanifolds in the complex projective space P’ . How does one measure volumes in P~? 
How is the volume of a submanifold defined and computed? 
We begin with the Hermitian inner product on the tensor space C”!*""'*"4, This is 


We view C”!*""*"4 as a real Euclidean space of dimension 27 - -- ng, where the Euclidean 
inner product is (A,B) +> Re((A,B)c). The same definitions make sense in C”, now 
identified with R?”, and we define the unit sphere in the Euclidean space C” as 


s7! := {a € C" | (aa)c = 1}. 


This sphere is a real manifold of real dimension dimg S?™! = 2n — 1. The projection 
m : S*"-! _, pr! sends a vector a € S?! to its class in complex projective space. The 
fiber of the quotient map z has real dimension one: it is a circle. The tangent space of the 
sphere S*”~! at a point a is the real vector space 


TS"' = {te C | Re((a,thc) = 0} = R”! 


Lemma 12.18 Let t € C”. We have Re((a, t)c) = 0 if and only if t is a point in the real 
hyperplane spanned by the real line through V—1-a and the codimension-two linear space 
defined by (a, t)c = 0. 


Proof The space of allt € C” such that Re((a, t)c) = 0 is a linear space of real codimension 
one in C” = R”. If tis areal multiple of V—1-a or (a, t)c = 0, then we have Re((a, t)c) = 0. 
The other inclusion follows from comparing dimensions. o 


For x = z(a) € P""!, the fiber under 7 is n~! (x) = {exp( V-19) a| we R}. Its tangent 
space at a is spanned by (5 exp(V—1¢) - a)|,=-0 = V—1- a. Consequently, Lemma 12.18 


implies that the tangent space of the sphere 7,S*”"! has an orthogonal decomposition 
into the tangent space of the fiber Taz~!(x) = R- V—la and the linear space defined 
by (a, t)c = 0. Therefore, we can view the following as the tangent space of projective 
space at the point x = z(a), where a € S2”"!: 


TP"! = {teC"| (athe = 0} = R”. vet) 


In conclusion, the complex projective space P”~! is a Riemannian manifold with respect to 
the Euclidean structure Re((a, b))c. This induces a notion of volume for subsets of P’~!. 
Our metric structures on S?”-! and P”! have the property that the projection 7 is a 
Riemannian submersion. This implies that the m-dimensional real volume of a measurable 
subset U c P”! is 
volm(U) = z Volin+1 (x! (U)), 


m 


since the preimage 77! (x) for x € P”! is a circle. 
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The volume of projective space itself is 


qn! 


- 1 " 
voln- (P"}) = = volan-1 ($) = GD (12.16) 


In the following, we sometimes omit the subscript from vol when the dimension is clear 
from the context. For the next proposition, we abbreviate m := dimp Sp. 


Proposition 12.19 The m-dimensional volume of the Segre variety Sp in (12.14) is 
vol(Sp) = vol(P™~!) --- vol(P"4~!). 
Proof The Segre map from (12.7) for complex projective space is the smooth embedding 
Yp : PHT! x... x pra! — Sp. 
Let (x1, ..., X4) € P™7! x ++» x P™7! and a; € S?"7! with a(a;) = x; fori = 1,...,d. 
We also fix elements t; € C™ with (a;,t;)c = 0 for all i. These specify vectors in the 


various tangent spaces (12.15). The derivative of the Segre map wp at our point takes 
(t),...,ta) E€ Tix, ayer xq)(P™ 1 x +++ x Pra!) to 


0 := t @a7@---@aqg t+ aj @tr®@---@aqgt--- +a @a®--- Sty. 
It follows from (12.3) that the terms in this sum are pairwise orthogonal. Therefore, 
Hal? = evil? + ltal? +--+ + Itall’. 
Hence, the derivative of wp preserves norms, and wp is a volume-preserving embedding.O 
Proposition 12.20 The 2(n — 1)-dimensional volume of the Veronese variety in (12.14) is 
vol(Ve) = d”!-vol(P"}). 


Proof The proof is similar to that of Proposition 12.19. We denote the projective Veronese 
map by vp : P”! — Vp. Just like the Segre map, vp is a smooth embedding. Let x € P”-! 
be a point in projective space and fix a representative a € S*”"! for x; i.e., z(a) = x. Let 
t € C” with (a, t)c = 0. The derivative of vp at a maps t € TP”! to 


0 := t@a®---@at+a@t®:---@at+---+a@a®:-- St. 


It follows from (12.3) that the terms in this sum are pairwise orthogonal, so ||6||? = d||t||*. 
This shows that the derivative of vp scales norms by a factor of Vd. We get the formula 


vol(Vp) = (Va ËP". vol(P"!) = d™!.vol(P™}), 


which relates the volume of the Veronese variety to that of the underlying projective space.O 


The volume of a complex projective variety is closely related to its degree. This is 
the content of Howard’s Kinematic Formula [94], which concerns the average volume of 
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intersections of submanifolds in homogeneous spaces. For complex projective space, we 
apply [94, Theorem 3.8 & Corollary 3.9] as follows. Let M c P be a smooth manifold of 
complex dimension m. Here, the Kinematic Formula states 


Volam(M) 


Ey #(MnU-(PN~™ x {0}")) = Volo (PY 


where the expectation refers to the probability Haar measure on the unitary group U(N +1). 

If X c P“ isa smooth variety of complex dimension m, then the number of intersection 
points #(X N U - (PN~™ x {0}”)) equals the degree of X for almost all U € U(N + 1). 
Thus, we take the expected value of a constant function: 


VOlom(X) 


ies 
ea ) Vvolom(P”)’ 


where m = dimc(X). (12.17) 


Using Propositions 12.19 and 12.20, our volume computation yields the following result: 


Corollary 12.21 The degrees of the Segre variety and the Veronese variety are equal to 


E (n+: +na-d)! 
(a) ae p ie iaa 


(b) deg(Vp) = d"™'. 
Proof The second formula follows directly from Proposition 12.20 and (12.17). For the first 


n”! 


formula, we recall from (12.16) the volume of projective space: volz(n-1) (P=!) = 


(n-1)!" 
We set m := dimc Sp. Using Proposition 12.19 and (12.17) we then have 
vol(P™!~!) --- vol(P"47!) mois (14-1) m! 
deg(Sp) = = = — l 
vol(P”) mm (n, —1)!---(ng- 1)! 
o Gh eo Fug =a)! 
~ (ny —Dt-+ (ng —- DU 
where we have used that m = nı +--+ +ng—-d. oO 


Remark 12.22 By (12.2), a linear equation on the Veronese variety Vp is a homogeneous 
polynomial equation of degree d on projective space P’~!. Thus, deg(Vp) = d”! means 
that n — 1 general homogeneous polynomials of degree d have d”~! zeros. 


Remark 12.23 Howard’s Kinematic Formula [94] also provides the following result for real 
projective space py . Let M c PN be a real submanifold of real dimension m. Then, the 
Kinematic Formula implies that Ey #(M N U - (Py = x {0})) = volm(M)/volm(PR), 
where the expectation refers to the probability measure on the real orthogonal group 
O(n + 1). Thus, the volume of real projective varieties can be interpreted as an “average 
degree”. We can use the same proof strategies as above to show that the projective volume 


of the real Segre variety S is equal to vol(P2'') e - vol(P247)) and the projective volume 


of the real Veronese V is Vd”! - vol (P2!). The latter result was first derived by Edelman 
and Kostlan in their seminal paper [66]. They use this to find the average number of real 
zeros of a system of homogeneous polynomials of degree d in n variables. 
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Chapter 13 A 
Computer Vision Cheek for 


| updates 


The field of computer vision studies how computers can gain understanding from images 
and videos, similar to human cognitive abilities. One of the classical challenges is to 
reconstruct a 3D object from images taken by several unknown cameras. While the resulting 
questions in multiview geometry [80] have a long history in computer vision, recent years 
have seen a confluence with ideas and algorithms from algebraic geometry. This led to 
the development of a new subject area called Algebraic Vision [104]. In this chapter, we 
present an introduction to this area from the perspective of metric algebraic geometry. 


13.1 Multiview Varieties 


A pinhole camera consists of a point c, called camera center, and an image plane H in 
3-space. Taking an image of a 3D point x is modeled by the following process. One draws 
a line between x and the camera center c. The intersection point of that line with H is 
the image point. This is shown in Figure 13.1. The common algebraic model for such a 
camera is a surjective linear projection C : P? --» P?, given by a full-rank 3 x 4 matrix A, 
i.e., C(x) = Ax. Thus, we work in projective geometry, with homogeneous coordinates. 
We take 3D points in projective space P*, and we identify the image plane H with the 
projective plane P*. For arbitrary A, such cameras are called projective or uncalibrated. 
In many applications, one does not allow arbitrary matrices A because one assumes 
several internal camera parameters, such as focal length, to be known. Calibrated cameras 
are those that can be obtained by rotating and translating a camera in standard position; see 
Figure13.1. The standard camera is the linear projection P? --> P2, [x1 : x2 : x3 : x4] |> 
[x1 : x2 : x3], given by the 3 x 4 matrix A = [3 [0]. Thus, all calibrated cameras are 
given by matrices A = [R | t|, where R € SO(3) and ż € RÌ. Between uncalibrated and 
calibrated cameras, there are also partially calibrated camera models; see [80, Chapter 6]. 
For 3D reconstruction, one needs at least two cameras. The joint camera map is 


D: Cn XX -> Y. (13.1) 


© The Author(s) 2024 163 
P. Breiding et al., Metric Algebraic Geometry, Oberwolfach Seminars 53, 
https://doi.org/10.1007/978-3-031-51462-3_13 


164 13 Computer Vision 


22 @ X= (%Y,Z) c 


e7 
-T C(x) =(x/z,y/z, Do 
(x)= (x/z,y/z,1) Ra 


(%0,1) 


Fig. 13.1: Calibrated cameras: in standard position (on the left) and arbitrary (on the right). 


This map takes an m-tuple of cameras (C1, ...,Cm) € Cm and an algebraic 3D object 
x € X to the m images (C1 (x), ...,Cm(X)) € Y. Here, X and Y are appropriate spaces 
of objects and their images, respectively. For instance, the 3D object x could be a single 
point (i.e., X = P3), a line (i.e., X = Gr(1, P?)), an arrangement of points and lines with 
prescribed incidences, a curve, or a surface. The image space Y parametrizes m-tuples of 
points, lines, or other arrangments in P?. For instance, if X = P? then Y = (P?)™. 

Formally, the task of 3D reconstruction from unknown cameras means to compute fibers 
under the joint camera map ®. Thus, there are unknown objects in 3-space and m unknown 
cameras. We get to see the planar projections of the objects. The task is to find the original 
objects and the cameras with which the images were taken. 


Example 13.1 For m = 2 projective cameras observing k points, the joint camera map is 


D: (PR***)? x (P3)* --» (P2)* x (Pa)*, 


(13.2) 
(Aq, A2, X1,..-,Xk) > (Arx1,...,A1Xx, A2X1, . . . , A2XK). 


It is defined whenever none of the 3D points x; is the kernel of A; or A2. The kernel of the 
matrix A; is the camera center of C;. We note that ® maps a space of dimension 22 + 3k 
to a space of dimension 4k. The general non-empty fiber has dimension at least 15. This 
is given by transforming A; +> A;g and x; t= glx; for some g € PGL(4). For large k, 
the image of ® is a proper subvariety of its codomain. For small k, the map ® is dominant. 
The transition occurs for k = 7. Indeed, 7 is the largest k such that the map ® is dominant. 
We will study this issue of dominance transition in Section 13.3. 

For calibrated cameras, given by matrices A; = [ R; | t;], the joint camera map becomes 


® : (SO(3) x R?)? x (P3)* --» (P2)* x (P2)*. (13.3) 


Here, the domain has dimension 12 + 3k. The dominance transition occurs at k = 5. ° 
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We now change the problem, and we assume that the cameras are fixed and known. 
We are thus fixing a camera configuration C = (C),...,Cm) E€ Cm, and we consider the 
specialization Bc : X --> Y of the joint camera map. The Zariski closure of the image of 
that map is the multiview variety Mc of the cameras C. 

In what follows, we focus on the multiview variety of a single point in 3-space. This 
means that X = P? and Y = (P?)”. In this case, the multiview variety Mc is the closure of 
the image of the map given by multiplying the same vector with m distinct matrices: 


Oc : P -> (P*)™, x m (A1x, Aox,..., AmX). (13.4) 


This map is well-defined at all points of P? except for the camera centers ker(A;). If m > 2 
and not all camera centers coincide, then Mc is a threefold. For an algebraic geometer, 
Mc is the threefold obtained from P? by blowing up the camera centers. 

From a computational point of view, the first question is to find the implicit description 
of Mc as asubvariety of (P*)’. Writing (y1, . . - , Ym) for the 3m coordinates of (P?)’”, we 
seek multihomogeneous polynomials in the y; whose common zero set is equal to Mc. 

The desired implicit description is given by the 3m x (m + 4) matrix 


Aj yı 0.. 0 
A2 0 Y2 0 

Ma:=| . l (13.5) 
Am 0 0 ao Ym 


Here, A; is the 3 x 4 matrix that defines the camera C;, and y; is an unknown column vector 
of length 3, serving as homogeneous coordinates for the ith image plane P?. The following 
result on the multiview variety for a single point goes back to [88]. For recent ideal-theoretic 
versions of Proposition 13.2, see [104, Theorem 2] and the references therein. 


Proposition 13.2 The multiview variety for m cameras with at least two distinct centers is 
Mc = {(y1, <., Ym) € (P?)™ | rank Ma(y1,.-.,¥m) < M +4}. 


Proof (Idea) A tuple (y1, ..., Ym) € (P3) is in the image of (13.4) if and only if there 
exists x € P? and non-zero scalars A; such that A;x = Avy; fori = 1,...,m. The latter 
condition is equivalent to the vector (xT, —A,,..., —Am)" being in the kernel of the matrix 
Ma in (13.5). Taking the closure, we obtain that the multiview variety Mc consists of 
those tuples (y1,..., Ym) such that M4 has a non-zero kernel. o 


Example 13.3 For m = 2 cameras, the matrix in (13.5) has format 6 x 6, and it equals 


The multiview variety Mc is the hypersurface in P? x P? defined by det( M4) = 0. This 
determinant is a bilinear form. Hence, there exists a 3 x 3 matrix F such that 


det(Ma) = y} Fy. (13.6) 
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Note that the entries of F depend on the matrices A;. The matrix F is called the fundamental 
matrix. A computation verifies the following explicit formula: 


[2356] —[1356] [1256] 
F = |-[2346] [1346] —[1246] |. (13.7) 
[2345] —[1345] [1245] 


Here, [ijkl] is the determinant of the 4 x 4 submatrix with row indices i, j, k,l of the 
6 x 4 matrix [AT A; ] ". The matrix F has rank two. In fact, its right kernel is the image of 
the camera center of Cz under the camera C1. Similarly, the left kernel of F is the image 
of the camera center of Cı under the camera C2. A geometric interpretation is shown in 
Figure 13.2. The two cameras are projections onto the two planes. © 


Fig. 13.2: The image of the center of one camera under the other camera (red points) is the kernel of the 
fundamental matrix. 


In computer vision, the term triangulation is used quite differently from the usage of that 
term in geometry and topology. Here, triangulation refers to the task of 3D reconstruction, 
assuming the configuration C of cameras is known. This task amounts to computing fibers 
under the specialized joint camera map ®c. 

Recall that, for m > 2 cameras, the generic fiber under Dc in (13.4) is empty. The 
measurements y := (y1,..., Ym) on the images are typically noisy. They do not lie on 
the multiview variety Mc, but hopefully close it. To triangulate a corresponding space 
point x € P’, one seeks to compute the point ¥ on Mc that is closest to y, and then identify 
its fiber 07! (¥). Hence, the algebraic complexity of triangulating a single point from m 
images is determined by the ED degree of the multiview variety Mc. 

In real-life applications, the image measurements (y1, ..., Ym) come from an affine 
chart (R*)” inside (P?)™”. In that chart, we are working with the affine multiview variety 


Mè := Mc n (R?)™. 


Triangulation thus means solving the ED problem (2.1) for the affine variety Mẹ in R”, 
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The algebraic complexity of that problem is the ED degree of the affine multiview 
variety Mọ. The interest in that ED degree originated in the computer vision community, 
namely in the articles [79] and [162] titled How hard is 3-view triangulation really?. 

The computer vision experts Stewénius and Nistér computed the ED degree of Mg for 
m < 7 cameras [77]. Using Grobner bases, one finds 


EDdegree(M¢) = 6,47, 148, 336,638, 1081 for m = 2,3,4,5,6,7. (13.8) 


That computation was the original motivation for the development of the general notion of 
ED degrees of algebraic varieties. It started the five-author collaboration on the article [60]. 
Based on the numbers (13.8), the following cubic polynomial for the ED degree of the 
multiview variety M¢., for m cameras was conjectured in [60, Example 3.3]. That conjecture 
was proven by Maxim, Rodriguez, and Wang in [128]. Their result is as follows: 


Theorem 13.4 The ED degree of the affine multiview variety Mẹ, for m = 2 cameras in 
general position is 


9 21 
Lm? — >=" + 8m —4. 


2 

We close this section by explaining the Gröbner basis computation that solves the 

triangulation problem for small m. The m cameras are given by m matrices A; = (aijk) 

of format 3 x 4. In each image plane P?, we choose the affine chart by setting the last 

coordinate to 1. Our data are points (y;1, yj2, 1) in these planes for i = 1,2,...,m. We 

similarly fix affine coordinates for the unknown point x = (1,2, x3, 1) in the space p. 
Consider the squared Euclidean distance to a point Bc (x) on Mọ: 


2 


gz \2i31X1 + Gj32X2 + Ai33X3 + Ai34 


2 
< AijiX1 + Aij2X2 + Aij3X3 + Giza 
A(x) = )) | yy 
i=l 
The optimization problem (2.1) asks us to compute the point x in R? that minimizes the 
distance A(x). To model this problem algebraically, we consider the common denominator 
of our objective function: 


m 
denom(x) := | [Grn + Gj32X2 + Aj33X3 + A)34). 
i=l 


We introduce a new variable z with z- denom(x) = 1. This ensures that all denominators 
are non-zero. Let J denote the ideal in the polynomial ring R[x,, x2, x3, z] that is generated 
by z-denom(x) — 1 and the numerators of the partial derivatives 0A(x)/Ox, for k = 1,2,3. 
Thus, 7 represents a system of four polynomial equations in four unknowns. The complex 
variety V (Z) is finite, and its points are precisely the complex critical points of the objective 
function A(x) in the open set given by denom(x) # 0. We compute a Grobner bases of J and 
we consider the set 8 of standard monomials. By Proposition 3.7, we have #8 = #V(J). 
That number equals 6, 47, 148,... for m = 2,3,4..., as promised in Theorem 13.4. 


Example 13.5 We consider the dinosaur data set from the Visual Geometry Group in the 
Department of Engineering Science at the University of Oxford. This data set contains 36 
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peed 


Fig. 13.4: The 4983 3D points that have been triangulated from the dinosaur data set in Example 13.5. 


cameras as 3 x 4 matrices and a list of pictures taken from 4983 3D points by these 
36 cameras. The data is shown in Figure 13.3. We solve triangulation for m = 2 ran- 
domly chosen cameras as described above, solving the resulting polynomial system with 
HomotopyContinuation. jl [31]. The result can be seen in Figure 13.4. © 
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13.2 Grassmann Tensors 


The fundamental matrix F from Example 13.3 is a projectively invariant representation of 
two cameras; see Proposition 13.12. Note that F has seven degrees of freedom since F has 
rank two and is defined up to scale. This number makes sense because each matrix A; has 11 
degrees of freedom, up to scale, and we are taking the quotient modulo the 15-dimensional 
group PGL(4). Thus, the dimension of the quotient space is 2-11-15 = 7. 

In this section, we introduce a class of tensors that generalize the fundamental matrix. 
These give projectively invariant representations for m > 2 cameras. We now allow higher- 
dimensional projections 


Ci : PN -> P". (13.9) 


Such projections can be used to model basic dynamics [96, 178]. The camera center of the 
camera C; is its base locus in P^. This is represented by the kernel of the corresponding 
matrix A; with N+ 1 columns. The approach we shall describe was introduced in computer 
vision by Hartley and Schaffalitzky [78]. 

As before, the multiview variety is the closure in P™ x --- x P” of the image of PN 
under the C;. We are interested in tuples of linear spaces L; c P”! whose product meets the 
multiview variety. To this end, we fix the cameras C; in (13.9), and we fix c; = codim(L;) 
with 1 < c; < ni. Let Tc. be the closure of 


{ (Ly,...,Lm) € Gr(n, — c1, P!) x... X Gr(nm — Cm, P™”) | 


13.10 
ax € PX : C(x) € Lis... , Cm(X) € Lm }. ( ) 


Note that I'c,c is a subvariety in a product of Grassmannians. This projective variety fills 
its ambient space when the c; are small. It has high codimension when the c; are large. The 
following theorem concerns the sweet spot in the middle. 


Theorem 13.6 For generic cameras C;, the variety Ic,c is a hypersurface if and only if 
Cy te: +Cm = N +1. In that case, its defining equation is multilinear in the Pliicker 
coordinates of the Grassmannians Gr(n; — ci, P™). 


Multilinear forms are represented by tensors, just like bilinear forms are represented 
by matrices. The equation described in Theorem 13.6 is a multilinear form in m sets 
of unknowns, namely the Pliicker coordinates of the Grassmannians Gr(n; — c;, P”) for 
i = 1,...,m. The Grassmann tensor of the camera configuration C is the order m tensor 
which represents the multilinear equation of the hypersurface Ice. 

Before we come to the proof of Theorem 13.6, we present a census of all Grassmann 
tensors in the case of primary interest in computer vision, namely N = 3 and n; = 2 


for all i = 1,...,m. According to Theorem 13.6, a Grassmann tensor exists whenever 
Cp te+++¢m =4and1<c; <2fori=1,...,m. 
Example 13.7 (m = 2,c,; = co = 2) Here, the Grassmann tensor is the fundamental 


matrix F. The ambient space in (13.10) is (Gr(2 — 2, P?))? = (P?)* with coordinates 
(y1,y2), one for each of the two image points. The hypersurface T'(c,,c,),(2,2) is the 
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multiview variety Mc, ,c, in Example 13.3. The bilinear form in (13.6) is represented by 
the 3 x 3 matrix F. The formula (13.7) can be obtained by computing the determinant of 
the 6 x 6 matrix M4 using Laplace expansion with respect to the last two columns. © 


Example 13.8 (m = 3,c, = 2,c2 = c3 = 1) This Grassmann tensor has format 3 x 3 x 3. 
It is known as the trifocal tensor of three cameras. The ambient space in (13.10) is 
Gr(2 — 2, P?) x (Gr(2 — 1, P*))? = (P*)°. The first plane P? has coordinates y1. The other 
factors represent lines in P*, each spanned by two points y;1, y;2 for i = 2,3. The Pliicker 
coordinates of the line y;; V y;2 are the 2 x 2 minors of the 3 x 2 matrix [y;1 yj2|. The 
determinant of the following 9 x 9 matrix is a trilinear form in y1, y21 V y22, and y31 V y32: 


Aj yı 0 0 0 0 
det | A2 0 yun y2 0 0 
A3 0 0 O y3 y32 


There are actually three trifocal tensors, one for each choice for the triple of codimensions 
(cj, €2,¢3) € {(2, 1,1), (1,2, 1), (1, 1,2)}. The variety of all trifocal tensors, for one such 
fixed choice, was studied by Aholt and Oeding in [3]. © 


Example 13.9 (m = 4,c1 = c2 = c3 = c4 = 1) Here, we get the quadrifocal tensor of 
format 3 x 3 x 3 x 3. It represents a multilinear form on four sets of lines y;1 V y;2, namely 
the determinant of the 12 x 12 matrix 


Al Yi Yi2 0 0 0 0 0 0 
A: 0 0 yoiyo 0 0 0 O 
A3 0 0 0 0 y31 y32 0 0 
A4, 0 0 0 0 0 O Yai yao 


det 


The variety of all quadrifocal tensors was studied by Oeding in [139]. © 
Proof (of Theorem 13.6) This result appears in [78]. We ae an independent proof. For 


general L; € Gr(n; — ci, P"), the back-projected space Li := = Cr ee 2 also has codimen- 
sion c;, i.e., Li € Gr(N — ci, PN ). If} ci < N, the nae ae ..., Lm always intersect, 
meaning that I’¢,. is equal to its ambient space. 

Now suppose >) c; = N + 1. Due to the genericity of the cameras, the multiview variety 
Mc of a single point (i.e., the Zariski closure of the image of ®c : PX --> PM x... Pym) 
has dimension N. Hence, the product Lı x --- X Lm of general subspaces L; c P™ of 
codimension c; does not intersect the multiview variety Mc. Therefore, c¢,< is a proper 
subvariety. We show that it is a hypersurface and we simultaneously compute its multidegree 
by intersecting it with generic pencils in Gr(m, — c1, P™) X -++ X Gr(mm — Cm, P"™). 

It suffices to show that every general pencil meets Ic, in exactly one point. Such a 
pencil is of the form Lı x --- x Ly X ++- X Lm, where L; (for i + k) is a general point in 
the Grassmannian Gr(n; — c;, P"), and 


Ly = {L € Gr(ng — ck, P"*) | Vk C L C Wy}, 


where Vg, Wz © P"* are general - i codimension cx + | and cx — 1, respectively. 
The back-projected spaces L,...,We,..., Lm intersect in a single point x € P^. That 
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point x does not lie in the back-projected space V;.. Otherwise, for general L € Lg, the point 
L,X--:XLx--++XL,, would be in Icc. This a contradiction since the codimension of Ice 
is at least one. Thus, there is exactly one L € Lp (namely, the span of Vz with Cz, (x)) which 
satisfies L1 X---xXLX---xXLm € Ic,c. This shows that I'c,c is a hypersurface of multidegree 
(1,..., 1). The multidegree of a hypersurface in a product of Grassmannians equals the 
multidegree of its equation in Pliicker coordinates [71, Chapter 3, Proposition 2.1]. Similar 
arguments show that Ic, has codimension > 2 whenever >) c; > N+ 1. o 


By varying the Grassmann tensors over all camera configurations, we obtain a variety 
that parametrizes camera configurations modulo projective transformations; see Proposi- 
tion 13.12. In the classical case of three cameras (N = 3), this quotient space has dimension 
11m — 15. This approach yields the variety of fundamental matrices for m = 2, the trifocal 
variety [3] for m = 3, and the quadrifocal variety [139] for m = 4. For m > 5, we can form 
quotient varieties by taking appropriate collections of fundamental matrices, trifocal ten- 
sors, and quadrifocal tensors. A more systematic representation is furnished by the Hilbert 
schemes, which were studied by Aholt, Sturmfels, and Thomas in [4]. 


13.3 3D Reconstruction from Unknown Cameras 


We now turn to the problem of 3D reconstruction from images taken by unknown cameras. 
This amounts to computing fibers under the joint camera map ® in (13.1). Typically, 
a nontrivial group G acts on the fibers of ® since global 3D transformations that act 
simultaneously on the cameras and the 3D scene do not change the resulting images. Most 
real-life applications assume that the cameras are calibrated, so we are naturally led to the 
rotation group SO(3), which is an important player in metric algebraic geometry. 


Example 13.10 For a projective camera A € PR*** that observes a point x € P, the 


projective linear group PGL(4) acts via g +» (Ag7!, gx) on cameras and points without 
changing the image Ag™! - gx = Ax. Hence, PGL(4) acts on the fibers of the joint camera 
map in (13.2), where projective cameras observe k points in P3. This means that 3D 
reconstruction is only possible up to a projective transformation. 

The action of the group PGL(4) does not map calibrated cameras to calibrated cameras. 
The largest subgroup G of GL(4) that preserves the structure of calibrated camera matrices 
[ R|t] € SO(3) x R? equals 


G = {g € GL(4) |g = |2 }] for some R € SO(3), t € R°, A € R\ {0}. (13.11) 


This is the scaled special Euclidean group of RÌ. Paraphrasing for computer vision, we 
conclude that 3D reconstruction with calibrated cameras is only possible up to a proper 
rigid motion and a non-zero scale. © 


The state-of-the-art 3D reconstruction algorithms work with minimal problems, where 
the joint camera map ® is dominant and its fibers are generically finite modulo the group G. 
For minimal problems, computing the fibers of ® means solving a square parametrized 
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system of polynomial equations. Such 3D reconstruction problems use the minimal amount 
of data on the images while having finitely many solutions. The algebraic degree of a 
minimal problem is the number of complex solutions for generic data. 


Proposition 13.11 Consider 3D reconstruction for m > 2 unknown cameras observing k 
points. For projective cameras, there are two minimal problems, namely (m, k) = (2,7) 
and (m, k) = (3,6), both of algebraic degree 3. For calibrated cameras, the only minimal 
problem is (m, k) = (2,5), with an algebraic degree of 20. 


Proof Form > 2 projective cameras and k points, the joint camera map © takes the quotient 
space ((PR**)™ x (P3)*)/PGL(4) to the space of images ((P?)*)™. (A comment for 
experts: the quotient is meant in the sense of geometric invariant theory, but we here only 
need a birational model.) If the 3D reconstruction problem is minimal, then both spaces 
have the same dimension, which gives 11m +3k — 15 = 2km. This equation has precisely 
two integer solutions m > 2 and k > 1, namely (m, k) = (2,7) and (m, k) = (3,6). Both 
yield minimal problems of algebraic degree three [76]. These degrees were found in the 
19th century by Hesse [87] and Sturm [163]. 

For calibrated cameras instead of projective cameras, the domain of the joint camera 
map ® changes to ((SO(3) x R?)” x (P?)*) /G, where G is the group in (13.11). Now, the 
two spaces have the same dimension if and only if 6m + 3k — 7 = 2km. The only relevant 
integer solution is (m, k) = (2,5). This yields a minimal problem of algebraic degree 20. 
This problem has the label 50002 in the census of minimal problems found in [64, Table 1]. 
It is the minimal problem most widely used in practical 3D reconstruction algorithms. o 


Later, we shall take a closer look at the systems of polynomial equations arising from 
Proposition 13.11. First, however, we explain the practical usage of minimal problems for 
3D reconstruction in computer vision. Imagine two calibrated cameras that observe 100 
points. Then, the joint camera map ® in (13.3) for k = 100 is not dominant. Hence, the 
fiber under ® of two noisy images is empty. We would need to find a closest point on the 
image of ®, before we can compute any fiber. In other words, we must first solve an ED 
problem like that in Theorem 13.4. But this is now even harder because the cameras are 
unknown. In addition, in practical scenarios, it often happens that some of the given point 
pairs are outliers. Then, solving the ED problem yields incorrect solutions. 

In practice, one simply avoids any such ED problem. Instead, one chooses five of the 
100 given point pairs and solves the associated minimal problem in Proposition 13.11. 
This process gets repeated many times. The solutions to the many minimal problems get 
patched together via random sample consensus (RANSAC), until all 100 points and both 
cameras are reconstructed. For more details, see [104]. 

Since minimal problems must be solved many times for RANSAC, it is crucial that their 
formulation as a square polynomial system is efficient. A common strategy for simplification 
is to first reconstruct the cameras only and afterward recover the 3D scene via triangulation. 
Hence, instead of solving the full minimal problem at once, we start with a polynomial 
system whose only unknowns are the camera parameters. 

To parametrize the m cameras modulo the group G, we use the Grassmann tensors. We 
work in arbitrary dimensions, as in (13.10), starting with G = PGL(N + 1). We consider 
m surjective projections C; : PN --> P™ and integers C),...,Cm that satisfy 1 < c; < ni 
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and cj +: +-+Cm = N+1. The Grassmann tensor Tce exists, by Theorem 13.6. Namely, Tc, 
is parametrized by the rational map 


MUTT) xeon (Ture! 


ye |: PROIHDXON+HD) y... x PRMmtDX(NAD _., prs Hiss 


This map sends the m cameras C; to their Grassmann tensor, written in Pliicker coordinates. 


Proposition 13.12 ([78]) The group PGL(N + 1) acts on the fibers of yc by componentwise 
right-multiplication. The map yc becomes birational on the quotient of its domain modulo 
that group action. 


Example 13.13 Consider the basic scenario in Example 13.7, where we have N = 3 and 
m =n, =m = C1 = C2 = 2. The map yc sends two 3 x 4 matrices A1, A2 to the 3 x 3 
matrix F in (13.7). The Zariski closure of the image of yc is the hypersurface F = V (det) 
defined by the 3x3 determinant in P R*** = P. It is birational to pairs of projective cameras 
modulo PGL(4). This quotient construction reduces the number of camera parameters from 
22 to 7. Any pair of image points (y},y2) € P? x P? imposes a linear condition on F, 
namely y; Fy; = 0 as in (13.6). Reconstructing a fundamental matrix means intersecting 
the seven-dimensional variety F with seven hyperplanes. We conclude that observing seven 
points with two projective cameras is a minimal problem of degree deg(F) = 3. © 


We now turn to pairs of calibrated cameras and the scaled special Euclidean group G 
in (13.11). A real 3 x 3 matrix E is called essential if it has rank two and the two non-zero 
singular values are equal. The Zariski closure of the set of essential matrices is the essential 
variety & in PR*? = P8, 


Theorem 13.14 The map yc in Example 13.13 is two-to-one when restricted to pairs of 
calibrated cameras modulo the group G. Its image is the essential variety &. This variety 
has dimension five, degree ten, and is defined by the following ten cubic equations in the 
entries of an unknown 3 X 3 matrix: 


1 
det E =0 and EE'E- (EE )E = 0. (13.12) 


Proof This can be proved by direct computations. The set of essential matrices is a semial- 
gebraic set of codimension three. Demazure [54] showed that the equations (13.12) hold on 
that set. With Macaulay2 [73], we can check that these ten homogeneous cubics generate 
a prime ideal of codimension three. The fact that the map yec is two-to-one can be verified 
by a computation in the coordinates that are used in [64]. oO 


The reconstruction of calibrated cameras is now analogous to Example 13.13. Any pair 
of image points (y1, y2) € P? x P? imposes a linear condition yE yı = 0 on &. Hence, we 
intersect the five-dimensional variety & with five hyperplanes. That intersection consists 
of ten complex matrices E. Each E comes from two pairs of calibrated cameras modulo G. 
We conclude that observing five points with two calibrated cameras is a minimal problem 
of algebraic degree 20 = 2 - 10, confirming Proposition 13.11. 

To reiterate: what we described is the standard approach in state-of-the-art reconstruction 
algorithms. Given two images, one first reconstructs an essential matrix using RANSAC 
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by intersecting the variety & with five hyperplanes, then one recovers the pair of calibrated 
cameras, and finally, one reconstructs the 3D points using triangulation. It would be inter- 
esting to study this process through the lens of metric algebraic geometry. In particular, 
following Chapter 9, what is the condition number of intersecting & with five hyperplanes 
y Ey, = 0 given by five image point pairs (y1, y2)? One may hope to utilize those condition 
numbers in homotopy continuation solvers. 

More generally, we can ask the same question for intersecting the variety Ge := im(y-) 
of Grassmann tensors with dim Gc hyperplanes given by (L1, ..., Lm) as in (13.10)? The 
condition number of intersecting a fixed projective variety with varying linear subspaces of 
complementary dimension was studied in [34]. However, that theory does not immediately 
apply to our problem since & is only intersected by special linear spaces, namely those that 
are intersections of five hyperplanes of the form {E : y Ey, = 0}. 
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Chapter 14 | A | 
Volumes of Semialgebraic Sets ais 


| updates 


In this chapter, we study the problem of computing the volume of a semialgebraic subset S 
of R”. Being semialgebraic means that S is described by a finite Boolean combination of 
polynomial inequalities. Most of our discussion is limited to semialgebraic sets of the form 


S = {xeR"| fœ) = 0} for some f € R[x1,...,Xn]. (14.1) 


This is the special case of only one polynomial inequality. The volume of S is the numerical 
value of an integral that is described by polynomials. Students encounter such integrals in 
Multivariable Calculus, and we shall review this elementary perspective in Section 14.1. 
The rest of the chapter is much less elementary. Algebraic geometers use the term period 
integrals for real numbers such as vol(S). Our objective is the highly accurate numerical 
evaluation of period integrals. This is relevant for many fields, including physics (Feynman 
integrals) and statistics (Bayesian integrals). We present two algebraic methods, developed 
recently by Lairez [116], Lasserre [86], and their collaborators. Section 14.2 uses linear 
partial differential equations (D-modules) to represent and evaluate integrals. Section 14.3 
explains the evaluation of integrals by means of semidefinite programming (SDP). 


14.1 Calculus and Beyond 


Suppose we are given a basic semialgebraic set in R”. This means that our set is described 
by a conjunction of polynomial inequalities; i.e., it has a representation of the form 


S = {xeR" | fi(x) > 0 and p(x) > 0 and --- and f(x) > 0}. (14.2) 


Here fi, fo,..., fk are polynomials in n unknowns with real coefficients. Our task is to com- 

pute the volume of S, as reliably and accurately as possible, from the input fi, fo,..., fx. 
The simplest scenario arises when k = 1, so our semialgebraic set S is the domain of 

nonnegativity of one polynomial f(x) = f (x1, . . .,Xn) with real coefficients, as in (14.1). 
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Writing dx for the Lebesgue measure on R”, we wish to evaluate the integral 
vol(S) = f dx. (14.3) 
S 


More generally, we can consider integrals fs g(x)dx, where g(x) is a rational function, or 
even an algebraic function. The value of such an integral (if it converges) is a real number, 
which is called a period [115]. Our integrals are special cases of period integrals. 


We begin with an instance where the volume of S can be found explicitly using calculus. 


Fig. 14.1: The yellow convex body is the elliptope. It is bounded by Cayley’s cubic surface. 


Example 14.1 (Elliptope) We consider the elliptope. This is the semialgebraic set 
S = {(x,y,z) € R? | M(x, y, z) is positive semi-definite}, 
where M (x, y, z) is the following 3 x 3-matrix: 


1 y 
M(x,y,z) = |x z 
y 1 


XN ees 


An equivalent formulation is that S consists of all points (x, y, z) in the cube [—1, 1]? such 
that the determinant det(M) = 2xyz — x? — y? — z? + 1 is nonnegative. It is the region 
bounded by the yellow surface in Figure 14.1. This picture serves as the logo of the Nonlinear 
Algebra group at the Max-Planck Institute for Mathematics in the Sciences in Leipzig. The 
semialgebraic set S happens to be convex. It illustrates several applications of algebraic 
geometry. In statistics, the convex set S comprises all correlation matrices. In optimization, 
it is the feasible region of a semidefinite programming problem; see [133, Chapter 12]. 
Semidefinite programming is our workhorse for approximating volumes in Section 14.3. 
We now compute the volume of the elliptope. We begin parameterizing its boundary 
surface (the yellow surface in Figure 14.1). Solving the equation det(M) = 0 for z with the 
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quadratic formula, we obtain 


Z = xyt Jey -x2 -y+1 = xy + J(1-x?)(1-y?) for (x, y) € [-1, 1]. 


The plus sign gives the upper yellow surface and the minus sign gives the lower yellow 
surface. The volume of the elliptope S is obtained by integrating the difference between the 
upper function and the lower function over the square. Hence, the desired volume equals 


vol(S) = S | AEA A = IRE 


The univariate integral on the right gives the area below the semicircle with radius 1. We 
know from trigonometry that this area equals 7/2, where m = 3.14159265.... We conclude 


2 


2 
vol(S) = > = 4.934802202... 


Thus, our elliptope covers about (17/2) / 8 = 61.7 % of the volume of the cube [—1, 1]?. o 


The number 2” /2 we found is an example of a period. It is generally much more difficult 
to accurately evaluate such integrals. This challenge has played an important role in the 
history of mathematics. Consider the problem of computing the arc length of an ellipse. This 
requires us to integrate the reciprocal square root of a cubic polynomial f(t). Such integrals 
are called elliptic integrals, and they represent periods of elliptic curves. Furthermore, in an 
1841 paper, Abel introduced abelian integrals, where g(t) is an algebraic function in one 
variable t. How to evaluate such an integral? This question leads us to Riemann surfaces 
and then to their Jacobians. And, voila, we arrived at the theory of abelian varieties. 


This chapter presents two current paradigms for accurately computing integrals 
like (14.3). The first method rests on the theory of D-modules, that is, on the algebraic 
study of linear differential equations with polynomial coefficients. Our volume is found as 
a special value of a parametric volume function that is encoded by means of its Picard- 
Fuchs differential equation. This method, which tends to appeal to algebraic geometers, 
was introduced by Lairez, Mezzarobba, and Safey El Din in [116]. 

The second approach is due to Lasserre, Henrion, Savorgnan, Tacchi, and Weisser 
[86, 169, 170]. At first glance, the approach in Section 14.3 might appeal more to readers 
from analysis and optimization. But, we hope that algebraic geometers will find it inter- 
esting as well, as it entails plenty of deep algebraic structure. The idea is to consider all 
moments Mmg = h x%dx of our semialgebraic set S, where again dx is the Lebesgue mea- 
sure, and to use relations among the me to infer an accurate approximation of mo = vol(S). 
That numerical inference rests on semidefinite programming [133, Chapter 12]. 
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14.2 D-Modules 


In calculus, we learn about definite integrals in order to determine the area under a graph. 
Likewise, in multivariable calculus, we examine the volume enclosed by a surface. Here, we 
are interested in areas and volumes of semialgebraic sets. When these sets depend on one 
or more parameters, their volumes are holonomic functions of the parameters. We explain 
what this means and how it can be used for accurate evaluation of volume functions. We 
present the method of [116], following the exposition given in [159]. 


Definition 14.2 The Weyl algebra (cf. [155, 159]) is the quotient of the free C-algebra 
generated by 2n variables x1, .. ., Xn, 1, . . -, On by the ideal generated by 0;x; — x;0; — 1 
fori = 1,...,n. We denote the Weyl algebra by 


D = C(x1,...,Xn,O1,..-,0n). 


In the definition of the Weyl algebra, the x; play the role of the usual variables in calculus. 
The 0;, on the other hand, play the role of partial derivatives. Indeed, if g : R” — Risa 
differentiable function, then the product rule yields 0;(x;: g) = g+x;(0;g). This is precisely 
the algebraic relation that we quotient out. 

Suppose that M is a D-module. We denote the action of D on M by 


Pef for Pe D,feM. 


Definition 14.3 Let M be a D-module. The annihilator or annihilating D-ideal of an 
element f € M is the D-ideal 


Annp (f) = {Pe D|Pef=0}. 


In applications, the D-module M is usually a space of infinitely differentiable functions on 
a subset of R” or C”. Such D-modules are torsion-free. In the following, we mainly focus 
on annihilators that are holonomic D-ideals. 


Definition 14.4 Let M be a D-module and f € M. For instance, f could be among the 
functions mentioned above. We say that f is holonomic if, for each k € {1,...,n}, there 
is a differential operator 


Mk 


Py = >" pir)  € (D A Cix.. -2n e])\{0} (14.4) 
i=0 
that annihilates f; i.e., Pk è f = 0. Here, the p; (X) are polynomials in x = (x1,...,Xn). 


If this holds, then we say that Annp (f) is a holonomic D-ideal. 


The term “holonomic function” is due to Zeilberger [179]. Many interesting functions are 
holonomic. For instance, holonomic functions in one variable are solutions to ordinary 
linear differential equations with rational function coefficients. This includes algebraic 
functions, some elementary trigonometric functions, hypergeometric functions, Bessel 
functions, period integrals, and many more. 
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Proposition 14.5 Every rational function is holonomic. 


Proof Let r be a rational function in x = (x1,...,Xn). It is annihilated by the operator 
r(x)0; — Or/Ox; fori = 1,2,...,n. By clearing denominators in this operator, we obtain 
a non-zero differential operator P; € C[x](0;) of order m; = 1 that annihilates r. o 


To find holonomic annihilating D-ideals, it is helpful to use computer algebra. 


Example 14.6 Letr € Q(x1, .. ., Xn) be a rational function. We can compute its annihilator 
in Macaulay2 [73] with the built-in command RatAnn. For instance, if n = 2 and r = o? 
then we can use the following code 

needsPackage "Dmodules" 

D = 0Q[x1,x2,d1,d2, WeylAlgebra => {x1=>d1,x2=>d2}]; 

rnum = x1; rden = x2; 

I = RatAnn(rnum, rden) 


This code fragment shows that r = x1 /x2 has Annp (r) = D{ 07, x10, — 1, x28182 + Oy}. © 


Now, let M be a D-module, f € M, and Annp(f) a holonomic D-ideal. As in (14.4), 
let mı, ..., Mn denote the orders of the differential operators P1, ..., Pn. Thus, Pk is an 
operator in 0; of order mg whose coefficients are polynomials in x. We fix initial conditions 
for f by specifying the following m mz ---m, numerical values for a general point xọ € C”: 


(a -+- di" e f)|x-xy Where O < ik < my for k=1,...,n. (14.5) 


The operators P; together with the initial conditions in (14.5) specify the function f. 

Suppose f(x) is an algebraic function. This means that there is a polynomial F € 
C[y,x1,...,Xn] such that f satisfies the equation F(f,x) = 0. Every algebraic function 
f(x) in n variables is holonomic. Koutschan [112] developed practical algorithms for 
manipulating holonomic functions. These are implemented in his Mathematica [102] 
package HolonomicFunctions [112]. Using the polynomial F as its input, this package 
can compute a holonomic representation of f. The output is a linear differential operator 
of lowest degree annihilating f. We now show this with an explicit example. 


Example 14.7 Let n = 1 and consider the algebraic function y = f(x) that is defined by 


1 
q(x, y) = xt + yf + Too -1. (14.6) 


Its annihilator in D can be computed as follows: 


<< RISC`HolonomicFunctions` 
q = y^4 + x44 + x*y/100 - 1; 
ann = Annihilator[Root[q, y, 1], Der[x]] 


This Mathematica code determines an operator P of lowest order in Annp (f). We find 


P = go(x) + g1 (x) 0 + g2(x) 0? + g3(x) 0°, 


with coefficients 
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g3(x) = (2x*4 1)7(25600000000x'* — 76800000000x* + 76799999973.x* — 25600000000) , 
g2(x) = 6x°(2x* + 1)(51200000000x' + 76800000000x* — 307199999946x* + 179199999973), 
g(x) = 3x*(102400000000x!°+204800000000x!?+2892799999572x® 
~3507199999444x*4+307199999953) , 
go(x) = —3x(102400000000x!® + 204800000000x!? + 1459199999796x® 
—1049599999828x* + 51199999993). 


This operator is an encoding of the algebraic function y = f(x) as a holonomic function. © 


As discussed above, we specify a holonomic function f in n variables by a holonomic 
system of linear PDEs together with a list of initial conditions. Initial conditions such as 
(14.5) are designed to determine the function uniquely inside the linear space Sol(/), where 
I C Annp( f). For instance, in Example 14.7, we need three initial conditions to specify the 
function f(x) uniquely inside the 3-dimensional solution space to our operator P. We could 
fix the values at three distinct points, or we could fix the value and the first two derivatives 
at one special point. We generalize the canonical representation (14.5) as follows. 


Definition 14.8 A holonomic representation of a function f is a holonomic D-ideal 7 
contained in the annihilator Annp (f) together with a list of linear conditions that specify f 
uniquely inside the finite-dimensional solution space of holomorphic solutions. Think of 
the initial conditions for solutions to differential equations. 


The next example shows the role of holonomic functions for metric algebraic geometry. 


Example 14.9 (The area of a TV screen) Consider the quartic polynomial q(x, y) from 
(14.6). We are interested in the semialgebraic set S = {(x, y) € R? | q(x, y) < 0}. This 
convex set is a slight modification of a set known in the optimization literature as “the TV 
screen”. Our aim is to compute the area of the semialgebraic convex set S as accurately 
as possible. One can get a rough idea of the area of S by sampling. This is illustrated in 
Figure 14.2. From the polynomial q(x, y) we read off that S is contained in the square 
defined by —1.2 < x,y < 1.2. We sampled 10000 points uniformly from that square, and 
for each sample, we checked the sign of q. This is a simple instance of the Metropolis— 
Hastings sampling algorithm that we discuss in Chapter 15. Points inside S are drawn in 
blue and points outside S are drawn in pink. By multiplying the area (2.4)? = 5.76 of the 
square with the fraction of the number of blue points among the samples, we learn that the 
area of the TV screen is approximately 3.7077. 

We now compute the area more accurately using D-modules. Let pr: S — R be the 
projection on the x-coordinate, and consider the length of the fibers of this projection: 


v(x) = E(pr t (x) NA S). 


This length is a holonomic function in the parameter x. Namely, it satisfies the third-order 
differential operator P we displayed in Example 14.7. The real roots of the resultant 
Res, (q, q/3y) = 25600000000x'? — 76800000000x8 


14.7 
+ 76799999973x* — 25600000000 aN 
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0.5 


-0.5 


Fig. 14.2: The TV screen is the convex region consisting of the blue points. 


are the two branch points x9 < x; of the map pr. These values of xo and x; can be written 
in radicals over Q. However, here we take an accurate floating point representation: 


xı = —xo = 1.00025446585025884547854576664356675008019627615897635 ... 


The desired area equals 
vol(S) = w(x1), 


where w(x) = i, ` v(t)dt is holonomic (see Proposition 14.15 below). One operator that 
annihilates w is Pô, where P € Annp(v) is the third-order operator above. To get a 
holonomic representation of w, we also need some initial conditions. Clearly, w(xọ) = 0. 
Further initial conditions on w’ are derived by evaluating v at other points. By plugging 
values for x into (14.6) and solving for y, we find w’(0) = 2 and w’(+1) = 1/100. Thus, 
we now have four linear constraints on our function w, albeit at different points. 

Our goal is to determine a unique function w € Sol(Pô) by incorporating these four 
initial conditions, and then to evaluate that holonomic function w at the branch point xı. To 
this end, we proceed as follows. We consider any point Xora € R at which the differential 
operator PO is not singular, i.e., the coefficient of the highest order differentiation does 
not vanish at Xora. Using the command local_basis_expansion that is built into the 
SAGE [171] package ore_algebra, we compute a basis of local series solutions to Pô at 
the point xora. Since that point is nonsingular, that basis has the following form: 


Sxorq,0(X) = 1 + Ox =z) N, Sxaa 1x) = (x —Xora) + O((x — Xord)*), 
Sond 2%) = (x- Xorda)” +0((x- Sond) Ys Sxoa,3 (X) =(x- Xora)” + O((x- tora) ): 


Locally at xora, our solution is given by a unique choice of four coefficients c,,,,,;, namely 


w(x) = Cxoq,0° Sxo1a,0 (4) + C xora, l * Sxoa, 1X) + Cxords2 ° Sxo1q,2 (x) + Cxords3 * Sxorq,3(X)- 
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At a regular singular point xrs, complex powers of x and log(x) can appear in the local 
basis at x,,. Any initial condition at a point determines a linear constraint on its coefficients. 
For instance, w’(0) = 2 implies co,; = 2, and similarly for our initial conditions at —1, 1, 
and xo. One challenge is that the initial conditions pertain to different points. To address 
this, we calculate transition matrices that relate the basis above of series solutions at one 
point to the basis at another point. These are invertible 4 x 4 matrices. 

With the method described above, we find the basis of series solutions at xı, along witha 
system of four linear constraints on the four coefficients cxi. These constraints are derived 
from the initial conditions at 0, +1, and xo, using the 4 x 4 transition matrices. By solving 
these linear equations, we compute the desired function value up to any desired precision: 


w(x1) = 3.708159944742162288348225561 14586537 12430658 199 1393470943... 


In conclusion, this real number is the area of the TV screen S in Figure 14.2 that is defined 
by the polynomial q(x, y) in (14.6). o 


Before we discuss the approach from Example 14.9 in a general setting, let us first study 
the structure of the space of holonomic functions. They turn out to have remarkable closure 
properties. In the following, let f and g be functions in x = (x1, . . . , Xn). For the proof of 
the following result, see [159, Proposition 2.3]. 


Proposition 14.10 Let f (x) be holonomic and g(x) algebraic. Then f(g(x)) is holonomic. 


While algebraic transformations of holonomic functions are again holonomic by Proposi- 
tion 14.10, this is not true for rational functions of holonomic functions. To see this, we 
consider holonomic functions in one variable f(x). A necessary condition for a meromor- 
phic function f(x) to be holonomic is that it has only finitely many poles in C. 

For a concrete example, we start with the holonomic function sin(x). This is annihilated 
by the operator ð? + 1. Its reciprocal f(x) = satay has infinitely many poles, so is not 
holonomic. Hence, the class of holonomic functions is not closed under division. It is also 
not closed under composition of functions since both 1 and sin(x) are holonomic. 


Proposition 14.11 Jf f, g are holonomic functions, then both f +g and f : g are holonomic. 


Proof Foreach indexi € {1,2,...,n}, there exist non-zero operators P; and Q; in C[x] (0;) 
which satisfy P; e f = Q; e g = 0. Set n; = order(P;) and m; = order(Q;). Then, the 
span of the set {ok o g}k=0,....m; has dimension < mj. 

Now consider ok e (f +g), which we can write as ok eft ar e g. Therefore, the 
C(x)-linear span of {ok e (f + g)}k=0,....n;+m; has dimension < n; +m;. Hence, there exists 
a non-zero operator S; € C[x](0;) such that S; e (f + g) = 0. Since this holds for all 
indices 7, we conclude that the sum f + g is holonomic. 

A similar proof works for the product f - g. For i € {1,2,...,n}, we consider the 
set {0k e (f- 3) ae D nim, BY applying the product rule, we find that the n;m; + 1 
generators are linearly dependent over the rational function field C(x). Hence, there is a 
non-zero operator T; € C[x](0;) such that T; è (f - g) =0. Hence, f- gisholonomic. o 
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The proof above gives a linear algebra method for computing an annihilating D-ideal 
for the sum f + g (resp. the product f - g), starting from such D-ideals for f and g. The 
following example illustrates Proposition 14.11. 


Example 14.12 This example is similar to one in [179, Section 4.1]. Consider the univariate 
functions f(x) = exp(x) and g(x) = exp(—x?). Their canonical holonomic representations 
are Ir = (0 — 1) with f(0) = 1 and J, = (ð + 2x) with g(0) = 1. We are interested in the 
function h = f + g. Its first two derivatives are obtained as follows: 


h 1 1l 
deh = |l —-2x d 
G2 eh 14x22] LE 


By computing the left kernel of this 3 x 2-matrix, we find that h = f + g is annihilated by 
In = ((2x +1)8? + (4x7 —3)0 — 4x? -2x +2), with h(0) =2, h’(0) = 1. 


For the product j = f- g we have j’ = f’g+ fg’ =f-g+f-(—-2xg) = (1 -2x)j, so the 
canonical holonomic representation of j is the D-ideal J; = (0 + 2x — 1) with j(0) = 1. © 


Proposition 14.13 Let f be holonomic in n variables and m < n. The restriction of f to 
the coordinate subspace {Xm+1 = +++ = Xn = 0} is a holonomic function in x1, ...,Xm- 


Proof Fori € {m+1,...,n}, we consider the right ideal x;D in the Weyl algebra D. This 
ideal is a left module over 


Diy = Cx, eos Xm, l, TE , m). 


The sum of these ideals with Annp (f) is hence a left Dm-module. Its intersection with 
D m is called the restriction ideal: 


(Annp(f)+Xm+1D +::-+XnD) A Dy. (14.8) 


By [155, Prop. 5.2.4], this Dm-ideal is holonomic and annihilates f(x1,...,%m,0,...,0).0 
Proposition 14.14 The partial derivatives of a holonomic function are holonomic. 


Proof Let f be holonomic and P; € C[x](0;)\{0} with P; e f = 0 for alli. We can write P; 


as P; = P;0;+a;(x), where a; € C[x]. If a; = 0, then P; è of = 0 and we are done. Assume 


a; # 0. Since both a; and f are holonomic, by Proposition 14,1 1, there is a non-zero linear 
operator Q; € C[x]{0;) such that Q; è (a; - f) = 0. Then Q;P; annihilates 0 f/0x;. o 


A key fact about D-modules (see [155, Section 5.5]) is that integration is dual, in the 
sense of the Fourier transform, to restriction. Here is the dual to Proposition 14.13: 


Proposition 14.15 Let f: R” — C be a holonomic function. Then the definite integral 


b 
F(x1,...,Xn-1) F. fxi,- -,Xn-1,Xn) dXn 
a 
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is a holonomic function in n — 1 variables, assuming the integral converges. 

By dualizing (14.8), we obtain the following D m-ideal, known as the integration ideal: 
(Annp(f) + mD +-:-+8nD) A Dm form <n. 


The expression is Fourier dual to the restriction ideal (14.8). This exchanges x; and 6;. If 
m =n- 1, then the integration ideal annihilates the holonomic function F above. 
Equipped with our tools for holonomic functions, we now return to volumes of semialge- 
braic sets, using the method of Lairez, Mezzarobba, and Safey El Din [116]. They compute 
this volume by deriving a differential operator that encodes periods of an integral [115]: 


Definition 14.16 Let R(t, x1, . . . , Xn) be arational function and consider the formal integral 


J Ririn dxi +- dxn. (14.9) 


Fix an open subset Q of either R or C. An analytic function ¢: Q — C is a period of the 
integral (14.9) if, for any s € Q, there exists a neighborhood Q’ C Q of s and an n-cycle 
y C C” with the following property: For all t € Q’, the cycle y is disjoint from the poles 
of R; := R(t, e) and 


(t) = J Rex.) dx, <- + dxn. (14.10) 
Y 


In the following, we make some assumptions. Let S = {f < 0} c R” be a compact 
semialgebraic set, given by f € Q[x1, ..., Xn]. Furthermore, let pr: R” — R denote the 
projection on the first coordinate. The set of branch points of pr restricted to the zero locus 
of f is the following subset of the real line, which is assumed to be finite: 


n- ô $ 
Zp = {p ER| dy = (92. --, Yn) ER"! : f(p,y) = 0 and $£ (p, y) =0, i =2,...,n}. 


The polynomial in the unknown p that defines È p is obtained by eliminating x2, . . . , Xn. It 
can be represented as a multivariate resultant, generalizing the Sylvester resultant in (14.7). 

Fix an open interval / in R with Z A Xf = 0. For any xı € J, the set Sx, = pr! (x4) NS 
is compact and semialgebraic in (n — 1)-space. We are interested in its volume. By [116, 
Theorem 9], the function v: J > R, xı => voly_1 (Sx) is a period of the rational integral 


1 igh 
$ = OID III hc (14.11) 
2ni f(x > X2, Xn) Ox2 


Lete < e2 < +++ < ex be the branch points in Èp and set eg = —co and eg+1 = œ. This 
specifies the pairwise disjoint open intervals J; = (ex, €x+1). They satisfy R\X+- = Ua Tk. 


Fix the holonomic functions w ,(t) = J 


a v(x;)dx,. The volume of S then is obtained as 


aye K-1 
vol(S) = f vaide = Y we (ern): 
el k=1 
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How does one compute such a sum? As a period of the rational integral (14.11), the 
volume v is a holonomic function on each interval 7%. A key step is to find an operator P 
in Dı = C(x1, 0)) that annihilates v|z, for all k. The product Pd annihilates the functions 
wx (x1) for all k. By imposing sufficiently many initial conditions, we can reconstruct the 
functions wx from the operator Pô. Initial conditions that come for free are wz (ex) = 0 for 
all k. Finally, for evaluating the functions wg we use methods for numerical integration. 

The differential operator P is known as the Picard—Fuchs equation of the period in 
question. The following software packages can be used to compute Picard—Fuchs equations: 


e HolonomicFunctions by C. Koutschan in Mathematica [102], 
e ore_algebra by M. Kauers in SAGE [171], 

e periods by P. Lairez in MAGMA [24], 

e Ore_Algebra by F. Chyzak in Maple [127]. 


We now apply this to compute volumes. Starting from the polynomial f, we compute 
the Picard—Fuchs operator P € Dı along with suitable initial conditions. For each interval 
I, we perform the following steps, here described for the ore_algebra package in SAGE: 


(i) Using the command local_basis_expansion, compute a local basis of series solu- 
tions for the linear differential operator Pô at various points in [ex, e€k+1). 
(ii) Using the command op.numerical_transition_matrix, numerically compute a 
transition matrix for the series solution basis from one point to another one. 
(iii) From the initial conditions, construct linear relations between the coefficients in the 
local basis extensions. Using step (ii), transfer them to the branch point ex41. 
(iv) Plug in to the local basis extension at e;4; and thus evaluate the volume of S npr! (Ix). 


Using this, we compute the volume of a convex body in 3-space, shown in Figure 14.3. 


Fig. 14.3: The quartic bounds the convex region consisting of the gray points. 


Example 14.17 (Quartic surface) Fix the quartic polynomial 


xy xyz ye z 


= + -= 3 
20 20 100 50 


f(x,y,z) = xf +y +z/+ (14.12) 
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and consider the set S = {(x, y,z) € R? | f(x,y,z) < o}. Our aim is to compute vol(S). 

As in Example 14.9, we get a rough idea by sampling. This is illustrated in Figure 14.3. 
Our set S is compact, convex, and contained in the cube -1.05 < x,y,z < 1.05. We 
sampled 10000 points uniformly from that cube. For each sample, we checked the sign of 
f(x,y,z). By multiplying the volume (2.1)? = 9.261 of the cube by the fraction of gray 
points among the sampled points, we found that vol(.S) ~ 6.4771. In order to gain a higher 
precision, we now compute the volume of our semialgebraic set S by means of D-modules. 

Let pr: R? — R be the projection onto the x-coordinate. Let v(x) = volz (pr! (x) NS) 
denote the area of the fiber over any point x in R. We write e} < e2 for the two branch 
points of the map pr restricted to the quartic surface {f = 0}. They can be computed with 
resultants. The projection has 36 complex branch points. The first two of them are real and 
therefore are the branch points of pr. We obtain e} ~ —1.0023512 and e2 ~ 1.0024985. 
By [116, Theorem 9], the area function v(x) is a period of the rational integral 


af y OTD ydg 
2ni J f(x,y,z) ôy l 


We set w(t) = l v(x) dx. The desired 3-dimensional volume equals vol(S) = w(e2). 
Using the function periods in MAGMA [24], we compute a differential operator P of 
order eight that annihilates v(x). Again, Pô then annihilates w(x). One initial condition 
is w(e;) = 0. We obtain eight further initial conditions w’(x) = vol2(S,) for points 
x € (€1,e2) by running the same algorithm for the 2-dimensional semialgebraic slices 
Sx = pr} (x) NS. In other words, we make eight subroutine calls to an area measurement 
as in Example 14.9. From these nine initial conditions, we derive linear relations of the 
coefficients in the local basis expansion at e2. These computations are run in SAGE [171] 
as described in steps (i), (ii), (iii) and (iv) above. We find the approximate volume to be 


vol(S) ~ 6.438832480572893544740733895969956188958420889235 1 16976266328923 128826 
9155273887642 16209 14955839890382943 1 1376088934526903525560097601024171 
190804769405534826558 1 1421276613538061395975793530527 1022089419155701 
521586470170874002194384529 14068685622775954 17150971 1339913473405961 
76328922060720855 16332397969 1633837600707387601073 18247752061504714 
367250460900923409066377732273390396822296235214963623286613117557 
930687544 14836072122568 105348 1 178760058264738867 1058 10326818911 
578448323758536767 168707442532146029753762594261578920477859. 


All digits in this number are guaranteed to be accurate. For details see [116, Section 4]. o 


14.3 SDP Hierarchies 


We now present our second method for computing volumes. It is based on a hierarchy 
of semidefinite programs (SDP). This is due to Lasserre and his collaborators. See the 
articles [86, 169, 170] and references therein. 
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As before, the goal is to compute the volume of a compact semialgebraic set $ c R” 
as accurately as possible. We consider an inclusion of semialgebraic sets S$ cC B c R”, 
where B is compact and serves as a bounding box, like B = [-1, 1]”. 

We assume that the moments of the Lebesgue measure on B are known or easy to 
compute, i.e., we have access to the values of the integrals 


ba = f xax = ae dx, dx2---dx, fora e N”. 
B B 
However, the moments m œ of the Lebesgue measure on S are unknown: 
Ma = [x = Jaa dxıdxz -+ dX, for a e N”. (14.13) 
S S 


These moments will be our decision variables. Our aim is to compute mọ = vol(S). The 
idea is to use the following infinite-dimensional linear program: 


Maximize the integral h du, where u and fi range over measures on R", where u is 
supported on S, fi is supported on B, and the sum u + fi is the Lebesgue measure on B. 


The unique optimal solution (u*, 4*) to this linear program can be characterized as 
follows: u“ is the Lebesgue measure on S, /i* is the Lebesgue measure on B\S, and the 
optimal value is vol(S) = h du*. This is described in [170, Equation (1)]. The linear 
programming (LP) dual is given in [170, Equation (2)]. 

We state our linear program in terms of the moment sequences m = (mq) and Ñ = (iq) 
of the two unknown measures u and fi. Namely, we paraphrase our problem as follows: 


Maximize mo subject to Ma + Ĥa = be forall œ € N”, 
where m and m are the moment sequences of u and Â, (14.14) 
respectively, with u supported on S and fi supported on B. 


We arrive at the moment problem, which is the question of how to characterize moments 
of measures. This problem has a long history in mathematics, and an exact characteriza- 
tion is very difficult. However, in recent years, it has been realized that there are effective 
necessary conditions. These involve semidefinite programming formulations in finite di- 
mensions, which are built via the localizing matrices we now define. 

In the following, we assume for convenience that S is defined by a single inequality: 


S={xeR" | f(x) >0}, where f(x) = 3 CX”. (14.15) 


a 


The theory works for all semialgebraic sets (14.2). Fix an integer d that exceeds the degree 
of f. We shall construct three symmetric matrices of format (ey) x ea whose entries 
are linear in the decision variables. The rows and columns of our matrices are indexed by 
elements a € N” with |a| = a; +--+ a, at most d. These correspond to monomials x” 
of degree < d. Our first matrix Ma(m) has the entry maig in row «œ and column £. Our 


second matrix Ma (im) has the entry ñ «+g in row œ and column £. Finally, suppose that the 
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polynomial f defining S has coefficients cq. Then, our third matrix M,(fm) has the entry 
Diy Cy Ma+p+y in row @ and column p. We consider the following semidefinite program: 


Maximize mo subject to Ma + fia = ba 
for all a € N” with |a| < d, where the 
symmetric matrices Mg(m), Ma (in) 
and Ma( fm) are positive semidefinite. 


(14.16) 


The third matrix can be replaced by My (fm) where d’ = d — [deg(f)/2]. The objective 
function value depends on d, and it decreases as d increases. The limit for d — oo equals 
the volume of S. Indeed, this sequence of SDP problems is an approximation to (14.14). 
The convergence property was proved in [86]. 

The remainder of this section shows how to solve (14.16) in practice. It is based on 
[86, 169, 170]. We discuss an implementation in Mathematica [102]. This material was 
developed by Chiara Meroni. We are very grateful to her for allowing us to include it here. 

Our point of departure is the following question: given a sequence of real numbers 
m = (mq) q, does there exist a measure us supported on the set S such that (14.13) holds? 
Given d € N, let N7} be the set of a € N” such that ja] =a, +---+a@, < d. We also set 


deg f 
r = |- ——]. 
2 
The moment matrix and the localizing matrix for our sequence of moments m are 


nta) 


The moment matrix has size ( d n+d ae) x (EEr; 


x ("3 ). The localizing matrix has size ("%5 a 


Proposition 14.18 A necessary condition for a sequence m = (Mq) q to have a representing 
measure supported on S is that for every d € N the following matrix inequalities hold: 


Ma(m) >0 and My_,(fm) = 0. 


This result is a formulation of Putinar’s Positivstellensatz [86, Theorem 2.2]. The positive 
semi-definiteness of the moment matrix is a necessary condition for m to have a representing 
measure; the inequality with the localizing matrix forces the support of the representing 
measure to be contained in S = {f (x) = 0}. 


Example 14.19 Consider the disc S = {(x, y) € R? | f = 1 — x? — y? > 0}. Its moments are 


atl az+1 
r(s#)r(s=) 


ar ($(a1 +02 +4)) 


Mm,ar) = (-D% +1) ((-D% + 1) 


where I’ is the Gamma function. For d = 3, the moment and localizing matrices (14.17) are 
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x07000020 0 
070% 00005 0 
DE Se ie. 92008 
070700002 0 070000 
00007027002 Zo0zZ00¢ 
= 4 24 8 — | 22 32 96 
M3(M)=|o 900040000] ad M(fm)=] 5 0 0200 
0000 Fz0F0084 00000 
m ELS m us ELS ELS 
7027000020 0 Zo0Z004 
ELS ELS ELS 
0408&00008g0 
5 
o0o00240g400 %2 
These two matrices are symmetric and positive definite. © 


We consider the infinite-dimensional linear program (LP) on measures whose optimal 
value is the volume of S c B. The program was stated above. We use the formulation 
in [86, Equation 3.1] and [170, Equation 1]: 


P: oe. J dus, s.t. Us +HB\S = Hh- (14.18) 
Here, ys is a positive finite Borel measure supported on S, and uh is the Lebesgue measure 
on B. The adjective “infinite-dimensional” refers to the fact that we are optimizing over a 
set of measures, which is uncountable. Based on the theory of dual Banach spaces, one can 
talk about dual convex bodies, and construct a duality theory for infinite-dimensional LPs. 
In our case, the dual to the space of positive finite Borel measures is the set of positive 
continuous functions. This observation leads to the definition of an LP that is dual to P: 


P*: inf frai s.t. y 2 ls. (14.19) 
Y 


The decision variable y is a positive continuous function on B and 1s is the indicator 
function of S. There is no duality gap: the optimal values of (14.18) and (14.19) coincide. 
Note that the optimal value of P* is an infimum and not a minimum since we approximate 
the discontinuous indicator function 1g using continuous functions. 

The infinite-dimensional LP (14.18) can be approximated by a hierarchy of finite- 
dimensional semidefinite programs (SDPs) [117]. The optimal values of the hierarchy 
converge monotonically to the optimal value of the LP [86, Theorem 3.2]. There is again a 
primal and dual version of the SDPs. In our setting, the primal is 


Pa: max mo, s.t. m+m=b, Mg(m) > 0, Ma(m) > 0, Ma-r(fm) > 0. (14.20) 
mm 


Here m = (mq), M = (Ma), and b = (bq) contains the moments of B indexed by a € Noa 
This formulation is [170, Equation 3]. The optimal value of Pg is an upper bound for 
vol(S) since we are optimizing over a larger set. The dual SDP is [86, Equation 3.6], 
which is formulated using sums of squares of polynomials. The authors of [86, 169, 170] 
implemented the SDPs using GloptiPoly [85]. The next examples are computed with 
Mathematica [102]. We are going to include the linear condition m + m = b inside the 
condition on the moment matrix of M, by imposing directly that Mg(b — m) > 0. 
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Example 14.20 (TV screen) Consider S = {(x,y) € [-1.2, 1.2]? | f(x,y) > 0} c R? 
where f = —q is the quartic in (14.6). This convex set is shown in Figures 14.2 and 14.4. 
Recall from Example 14.9 that vol(S) = 3.7081599447.... 

Let us now try the SDP formulation above, with d = 10. The moment matrices Mj9(m) 
and M,o(b — m) have format 66 x 66. For instance, the second matrix looks like 


4-mo,o, =mo,1) F-™M0,2)._ -M(0,3) 
=m,1) ¥-MEe2) —MEe,3) $=mo4) = 
4 4 

Mio(b-—m) = | 37702) Pics) 50,4) Fae 
—™ 0,3) 5770,4) —1(0,5) 770.6) °° 


The localizing matrix Mg(f;m) has format 45 x 45. Its (œ, B) entry equals 
1 


Ma+p — M(4,0)+a+8 — (0,4)+a4+B Z TRAM (1,1) +a+B- 


100 


The optimal value of the semidefinite program Pjo is 4.4644647361..., the optimal value 
of P14 is 4.3679560947..., and for Pig we get 4.3241824171.... These numbers are upper 
bounds for the actual volume, as predicted. However, these bounds are far from the truth.o 
Example 14.21 (Elliptope) Consider f(x, y, z) = 1 — x? — y? — z? + 2xyz. This defines the 
elliptope S = {(x, y,z) € [-1, 1]? | f(x,y,z) = 0} c R?, shown in Figures 14.1 and 14.4. 
We already know from Example 14.1 that vol(S) = = = 4.934802202.... The upper 
bounds computed from the SDP (14.20) for d = 4, 8, 12 are respectively 7.3254012963..., 
6.6182632506..., and 6.303035372.... These numbers are still pretty bad. © 


pb 


Fig. 14.4: Left: the TV screen from Example 14.20. Right: the elliptope from Example 14.21. 


Examples 14.20 and 14.21 suggest that the convergence of the SDP approximation is 
quite slow. To improve this, one uses the method of Stokes constraints. This was introduced 
in [169, 170]. We shall now explain it. In the linear program P* (14.19) and in the SDP 
hierarchy (14.20), we aim to approximate a piecewise-differentiable function 1s with con- 
tinuous functions (respectively, polynomials). This produces the well-known Gibbs effect, 
creating many oscillations near the boundary of S in the polynomial solutions of the SDP. 

To remedy this, we add linear constraints that do not modify the infinite-dimensional LP 
problem but add more information to the finite-dimensional SDP. One concrete way to do 
this uses Stokes’ theorem and the fact that f vanishes on the boundary 0S of S = {f = 0}. 
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Let U be an open region in R” such that the Euclidean closure of U is our semialgebraic 
set S. Since OS is the zero set of a polynomial, it is smooth almost everywhere. The 
classical theorem of Stokes applies. This theorem states that Jos w = h dw for any 
(n — 1)-differential form w on R”. One consequence of Stokes’ theorem is Gauss’ formula 


1 V(x); N(x) dx = favvo dx. 
ðs Ss 


Here, V(x) is a vector field, div denotes the divergence operator, N(x) is the unit normal 
field of the algebraic variety OS that points outwards, and we are integrating against the 
standard measures on the respective domains. If the vector field is a scalar field times a 
constant vector ¢ € R”, say V(x) = v(x)c, and if Vu(x) denotes the gradient of v at x, then 


ef v(x) N(x) dx = f avow dx = e f vu dx. 


This formula holds because div(v(x)c) = Vv(x) - c + v(x) - div(c) and the divergence of a 
constant vector is zero. Since this identity holds for every c € R”, we have 


J v(x) - N(x) dx = [vos dx. (14.21) 
as S 


If v = 0 on OS, then the left-hand side of (14.21) is zero. This can be expressed in terms of 
measures and distributions, and added to (14.18) and (14.19) as in [170, Equation 17 and 
Remark 3]. In our SDP hierarchy, the Stokes constraints are written as follows. Let 


v(x) = f(x) x® 
for any multiindex a € N” with |a| < d+ 1-— deg f. To remedy the Gibbs effect, we require 
V(f(x) | = 0. 
This yields n new linear conditions for each a as above. 
Example 14.22 For the SDP in Example 14.20, the Stokes constraints for a given @ are: 
@1Ma+(-1,0) — (@1 + 4)M94(3,0) — @1Ma+(-1,4) — att Mta+(0,1) = 0, 
@2M g+(0,-1) — @Ma+(4,-1) — (@2 + 4)Ma+(0,3) — Lil m waq œ =0. 


For the SDP in Example 14.21, the Stokes constraints are: 


@1Ma+(-1,0,0) — (@1 +2)Ma+(1,0,0) — C1 Me+(-1,2,0) — 1 Me+(-1,0,2) +2(@1 + 1)me+(0,1,1) = 0, 
Q2Me+(0,-1,0) — @2Ma+(2,-1,0) — (@2 + 2)Ma+(0,1,0) — @2/Ma+(0,-1,2) + 2(@2 + 1)mMa+(1,0,1) = 0, 
03M 94(0,0,-1) Z @3Ma+(2,0,-1) — @3Mea+(0,2,-1) — (03 + 2)Ma+(0,0,1) + 2(@3 + 1) me+(1,1,0) = 0- 


Table 14.1 compares the optimal values in (14.20) with and without Stokes constraints. © 


As Table 14.1 shows, the convergence with Stokes constraints is much faster than without 
them. The intuition is that now, with the (dual) Stokes constraints added to P* (14.19), the 


192 14 Volumes of Semialgebraic Sets 


without Stokes with Stokes 
S volume ||d - - 
max Pg time max Pg time 
10}|4.464464...) 0.621093 |/3.709994...| 0.482376 
( } 3.708159...]/15|/4.367956...| 3.545369 ||3.708191...| 3.738137 


20}/4.324182...| 14.906281 ||3.708163...| 20.592531 


7.325401...} 0.124392 |/5.612716...} 0.077315 
4.934802... || 8 |/6.618263...] 7.222441 ||4.976796...| 7.178571 
12]/6.303035...|696.886298]/4.937648...|1105.619231 


Table 14.1: The optimal values of (14.20) with and without Stokes constraints for Examples 14.20 and 
14.21. The column “max Pg” displays the optimal value, whereas the column “time” gives the time, in 
seconds, for running the command SemidefiniteOptimization in Mathematica [102]. 


function we approximate is not just the indicator function of S. A detailed explanation, 
for a variant of the Stokes constraints, is given in [169]. The authors prove that, when 
adding these constraints, the optimal solution of the new P* becomes a minimum. This 
eliminates any kind of Gibbs effect, and guarantees faster convergence. In [169], the authors 
mention that, from numerical experiments, it is reasonable to expect that the original Stokes 
constraints and the new Stokes constraints are equivalent, but there is no formal proof yet. 

In this chapter, we considered semialgebraic sets (14.1) that are defined by only one 
polynomial inequality. This restriction was imposed to simplify the exposition. Two meth- 
ods for computing their volumes were presented in Sections 14.2 and 14.3. We did not 
offer a comparison between them: this is a topic for future research. Both methods apply to 
arbitrary semialgebraic sets. These can be written as finite unions of basic semialgebraic 
sets (14.2). In conclusion, computing volumes is important for metric algebraic geometry. 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International 
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Chapter 15 A 
Sampling Check for 


| updates 


In this book, we studied the metric geometry of algebraic varieties from an applied and 
computational perspective. We now add probability theory into this mix. This chapter is 
about sampling from a real variety X. Asmussen and Glynn [10] underline the importance 
of sampling as follows: “Sampling-based computational methods are a fundamental part 
of the numerical toolset across an enormous number of different applied domains”. 
Section 15.1 connects us to topological data analysis. Here we explain how sampling 
can be used to compute topological information about X. In Section 15.2, we discuss 
algorithms for sampling with density guarantees. The workhorse behind this is the ability 
to rapidly solve polynomial equations associated with X, namely intersecting with linear 
subspaces and finding ED critical points. In Section 15.3, we introduce Markov kernels on 
the variety X. These specify Markov chains that take steps on the variety X. While Markov 
chains on discrete state spaces are familiar in combinatorics and algebra, now the state 
space X is continuous. This requires some analysis that may be unfamiliar to our readers. 
Markov chains on X can be modified with the Metropolis—Hastings algorithm (Algo- 
rithm 6), in order to reach a desired stationary distribution. For instance, if X is compact, we 
could be interested in sampling from the uniform distribution. We will explain how to do 
this in Section 15.4. The name Chow in the section title is an allusion to Chow forms. These 
encode a variety X through its linear sections. In our probabilistic setting, distributions on 
X are represented by distributions on the Grassmannian. This can be viewed as an interpre- 
tation of the Chow form for random algebraic geometry. For the resulting class of sampling 
algorithms, we propose the acronym CMCMC, short for Chow Markov Chain Monte Carlo. 


15.1 Homology from Finite Samples 


Let X be a real algebraic variety in R”. Our aim is to compute a finite subset S C X, which 
we call a sample. A sample on a curve is shown in Figure 15.1. The sample S yields a 
discrete approximation of X that can be used to explore properties of the variety. 
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For instance, if g : X — Risa function, we can find a lower bound for the optimization 

problem maxyex g(x) by computing maXses g(s). Or, assuming that X has finite volume, 
. 1 . 1 

we can estimate the average value woICX) Jxex g(x)dx by the finite average Is] Vises g(s), 


0.5 
0.0 a 
-1.0 ts 


=L5 
-2 =I 0 1 2 3 


Fig. 15.1: A sample of points on the curve X = {x4 + yt — 2x? —2xy?-y+1=0}. 


In what follows, we use the Euclidean distance d(x,y) = ||x — y|| on the ambient 
space R”. This means that we consider our variety X with an extrinsic metric. The next 
definition refers to this extrinsic metric on the variety X. 


Definition 15.1 A finite subset S of X is called e-dense in X if, for all x € X, there exists 
ans € S with d(x,s) < e. If this holds, then we also say that S is an -sample for X. 


In this section, we are computing random samples on an algebraic variety X. In this 
computation, we can replace X by its smooth locus Reg(X) and sample from the manifold 
Reg(X). To keep the notation simple, we assume throughout this chapter that the variety 
X is smooth. It is a manifold that is embedded in the Euclidean space R”. 

We first recall a theorem due to Niyogi, Smale, and Weinberger [138]. This gives 
conditions for when the homology of X can be computed from a finite sample S. The idea 
is to compute the homology of the union of e-balls U = Uses B(s) of an e-sample S. 
The homology groups of a union of balls U can be computed from the associated Cech 
complex. The theorem explains how small e must be for this to work. 


Theorem 15.2 Let £ > 0, let S be an -sample for X, and write U = Uses Be(S) for the 


union of all e-balls around all points in S. If € < V5 1(X), where T(X) is the reach, then 
X is a deformation retract of U. In this case, the homology of X equals the homology of U. 


Proof See (138, Proposition 3.1]. o 


Di Rocco, Eklund, and Gäfvert [56] showed that, in Theorem 15.2, the reach t(X) can 
be replaced by the local reach t(x). For x € X, the local reach is defined as the quantity 


T(x) := sup {r > 0 | every u € B,(x) has a unique closest point on X}. (15.1) 
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Thus, the local reach T(x) is the distance from the point x € X to the medial axis Med(X); 
cf. Chapter 7. The global reach t(X) is bounded above by the local reach at each point; in 
fact, we have T(X) = infxex T(x). Thus, using the local reach instead of the global reach for 
upper bounding £ can make a significant difference. The result from [56] is: Zf € < : t(s) 
for all points s in an -sample S C X, then X is a deformation retract of U = Uses Be(8). 

If we only seek homology for small dimensions, then we can relax the conditions on €. 
We first need the notion of k-bottlenecks. For a finite subset B c X, let IT (B) denote the 
subset of points in R” that are equidistant to all points in B. This is an affine-linear space. 


Definition 15.3 Let B = {x,,...,x,} C X for k > 2. Let L; := x; + Nx, X denote the 
affine normal space of X at x;. If Lı N -++ N Lg NT(B) Nconv(B) + 0, we say that B is a 
k-bottleneck of X. The width of B is 


€(B):= inf d(x, u). 
ucI (B) 


Intuitively, k-bottlenecks identify data points u € R” that have precisely k nearest points 

on X. The convexity condition implies that u is in the “middle” of the x;, and not outside. 

For instance, the 2-bottlenecks of X are precisely the bottlenecks studied in Section 7.1. 
The weak feature size wfs(X) of the variety X is the smallest width of a k-bottleneck: 


wfs(X) := inf { €(B) | B is a k-bottleneck of X }. 


min 
2<k<EDdegree(X) 


The weak feature size is always greater than or equal to the reach. The reason is that 
the exponential map yz in (6.12) cannot be injective when € > €(B) for a k-bottleneck 
B = {x,,...,xx}. Indeed, if this holds, then the point u € T (B) with €(B) = d(x), u) has k 
points (x;,V1),..., (Xx, Vx) in its fiber under yz. 

For the next theorem, we fix € > 0 and a finite set § c R”. We define a two-dimensional 
simplicial complex C with vertex set S as follows. For x,y € S, the pair {x, y} is an edge 
of C if and only if at least one of the following two metric conditions is satisfied: 


1. d(x,y) < 2e, or 
2. d(x,y) < V8e and there is z € S with d(x,z) < 2e and d(y, Z) < 2e. 


We add the triangle with vertices x,y,z € S to C if and only if there is an edge between x 
and y, between x and z, and between y and z. The result is called Vietoris—Rips complex 
(at scale £). The following was proved in [56]. 


Theorem 15.4 Let S C X be an e-sample, where e < wfs(X). Let C be the Vietoris—Rips 
complex above. Then, Ho(X) = Ho(C) and Hı (X) = Hı (C). 


Remark 15.5 Theorem 15.4 is based on [46, Theorem 1]. Here, Chazal and Lieutier proved 
that, for e < wfs(X), the tubular neighborhood ,<x B«(x) is homotopy equivalent to X. 


Topological data analysis is concerned with the homology of a data space X. One can 
try to compute this from an ¢-sample. The difficulty is to determine e€ a priori. The input 
is some representation of X. This could be a set of polynomial equations that define X. 
The difficulty lies in the fact that, in order to choose e reliably, we need to compute some 
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invariants of X, such as the reach T(X), the local reach t(s) at various points s € X, the k- 
bottlenecks and their widths, or the weak feature size wfs(X). All of these are computations 
in metric algebraic geometry, based on what was explained in previous chapters. 


15.2 Sampling with Density Guarantees 


This section presents two sampling algorithms that are guaranteed to compute an e-sample 
in a box. The input is a variety X in R”. Both algorithms compute a sample of X N R, where 


R = [ai bi] X-X [an, bn] C R”. 


If X is compact, then it is desirable to use a box R that contains the variety X. To compute 
such an R, we first sample a point u € R” at random. Then, we compute the ED critical 
points on X with respect to u; i.e., the critical points of the Euclidean distance function 
X —> R,x +> d(x, u). From this, we infer r := maxxex d(x, u) and set R to be the box with 
center u and side length 2r. Alternatively, we can maximize and minimize the n coordinate 
functions over X. This will furnish a box with optimal values for a1, b1,..., an, bn. 

The first sampling algorithm we present is due to Dufresne, Edwards, Harrington, and 
Hauenstein [65]. Fix a box R. The basic idea for sampling from X N R is to sample points 
u € R” and then to collect the ED critical points with respect to u. The complexity of this 
approach therefore depends on the Euclidean distance degree of the variety X. 

The algorithm in [65] works recursively by dividing the edges of the box R in half, 
thus splitting R into 2” subboxes. In addition, one implements a database D that contains 
information about all the regions in R that already have been covered by at least one sample 
point. Algorithm 4 provides a complete description. Its correctness is proved in [65]. 


Theorem 15.6 (Theorem 4.4 in [65]) Algorithm 4 terminates with an e-sample of X A R. 


The second algorithm we present is due to Di Rocco, Eklund, and Gäfvert [56]. Their 
algorithm is also based on computing ED critical points on X. But, in addition, it also 
adds linear slices to the sampling. We need a few definitions. We set d := dim X. 
For every 1 < k < d, denote by Jg the set of subsets of {1,...,n} with k elements. 
Given T = {t1,...,tk} E€ Tk, we let Vr C R” be the k-dimensional coordinate plane 
spanned by e;,,...,€;,. For 6 > 0, we consider the grid 


Gr(6) := {5+ (a1 -en +-+ ak’ en) | a1,...,ax € Z} = 6- ZF. 


Let rr : R” — Vr be the projection. The affine-linear spaces n7! (g) for g € Gr(ô) specify 
the faces of a cubical tessellation with side length ô. 

Let B(X) be the width of the smallest bottleneck of X. The method from [56] is presented 
in Algorithm 5. It takes as input a number 0 < ô < Jr min{e, 2B(X)} for the grid size. 


Then, the sample is given by 


Se c= U U XNA r7 (g). (15.2) 
TETa gEGr(ô) 
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Algorithm 4: Finding ¢-samples by ED optimization in subdivided boxes [65]. 


Input: A real algebraic variety X c R”, £ > 0, anda box R = [a;1, b1] x--- X [an, bn]. 
Output: An e-sample S C XNR. 
1 Initialize S = Ø and D = Ø. The set S will contain the sample points. The set D serves as a 
database containing balls in R” that have already been covered in the process of the algorithm. 
2 for each subbox R’ of R that is not yet covered do 
3 Compute the midpoint u of R’. 
4 Compute the real ED critical points Æ c X with respect to u and compute 
r := minxex ||x — ull. 
5 Add the points in E to S. 
6 Add B, (u) to D (this open ball does not contain any point in X, so we do not need to 
consider this region any further and can label it as being covered). 
7 Add B,(y) fory € E to D. 


8 if the union of balls in D cover R’ then 

9 | Label R’ and all of its subboxes as covered. 
10 else 

11 | Split R’ into 2” smaller subboxes. 

12 end 
13 end 

14 if all subboxes of R are labeled as covered then 

15 return S. 


Algorithm 5: Find an -dense sample with bottlenecks, grids, and ED methods [56]. 


Input: A real algebraic variety X c R”, areal number 6 > 0 with 6 < F min{£,2B(X) }, and 
a box R. 
Output: An e-dense sample S C XN R. 
1 Initialize Ss = 0 and S% = 0. 
2 Setd :=dimX. 
3 for T € T4 and g € Gr(6) do 


4 Compute X N a (2). 
5 Add the points in X N ay (2) to Ss. 
6 end 


7 if d > 1 then 
8 for 1 < k < d do 


9 for T € Tk and g € Gr (ô) do 
10 Sample a random point u € R”. 
11 Compute E (T, g, u), which are the ED critical points on X N ny (g) with respect 
tou. 
12 Add the points in E(T, g, u) to S'i. 
13 end 
14 end 
15 Set S := Ss USS. 
16 return S. 


In words, the sample Ss consists of the points that are obtained by intersecting the variety X 
with the collection of affine-linear spaces n7! (e) that are indexed by T € Fq and g € Gr(ô). 
The dimension of a (g) equals the codimension of X. To ensure transversal intersections, 
we can always modify G7(6) by a random translation. If d = dim X > 1, then we compute 
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an additional sample. First, we pick a random point u € R”. Denote by E(T, g,u) the ED 
critical points on X N n7! (g) with respect to u. The additional sample is the set 


d-1 


S = W U U E(T,g,u). (15.3) 


k=1 TeTe geGr(6) 


The motivation for this extra sample is that E(T, g, u) contains a point on every connected 
component of the variety X N oe (g). The algorithm is summarized in Algorithm 5. It 
requires us to compute bottlenecks of X in order to specify the input. In the algorithm 
itself, ED critical points must be found many times. Here, our Chapter 2 becomes relevant. 


Theorem 15.7 (Theorem 4.6 in [56]) Algorithm 5 outputs an e-sample of X A R. 


We conclude this section with a third alternative for obtaining ¢-samples. This comes 
from the work of Niyogi, Smale, and Weinberger [138], which we already saw in Theo- 
rem 15.2. Suppose that the variety X is compact, and suppose further that we are able to 
sample from the uniform distribution on X. The algorithm simply consists of sampling k 
i.i.d. points S = {x,,...,x,} from the uniform distribution on X. 

The result in [138] implies that S is an e-sample with high probability provided k is 
large enough. More precisely, for a given O < 6 < 1, they specify a required sample size k 
such that the probability that S is an e-sample for X is at least 1 — 6. That sample size k 
depends on the reach and the volume of the variety X. The study and computation of these 
metric quantities is the main point of this book. We invite the reader to work through the 
previous chapters while taking a perspective towards random points, sampling, and its role 
in machine learning (cf. Chapter 10). This requires examining probability distributions on 
a real algebraic variety. We explain this in the next sections and we present algorithms for 
sampling from such probability distributions. 


15.3 Markov Chains on Varieties 


We will review one popular class of methods for sampling from probability distributions, 
namely Markov Chain Monte Carlo (MCMC) methods. When the state space is finite, 
this amounts to a random walk on a finite graph. Experts in algebraic statistics [167] are 
familiar with MCMC methods that rest on Markov bases. These are used to carry out 
Fischer’s Exact Test for conditional distributions; see [167, Chapter 9]. 

In this section, the situation is different because the state space X is continuous. As 
before, we assume that X is a real algebraic variety in R”. Now, the description of Markov 
chains requires a dose of analysis. In the following discussion, A denotes a o-algebra 
on X. Think of A as the collection of subsets of X that we can assign a measure to. The 
elements in A are called measurable sets. If u : A — Ris a measure and f : X — Risa 
measurable function, then we can integrate f against u. This integral is written as 


[te (dx) for AEA. 
A 
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The Lebesgue measure on X, which is induced from the ambient R”, is simply denoted 
by dx. If u is a probability measure with a probability density ¢, then u(dx) = o(x)dx. 
For a measurable set A € A, let vol(A) = f a qX be the volume of A. Assume that X 
is compact. Then vol(A) < œ and the probability distribution with probability measure 
H(A) := vol(A)/vol(X) is called the uniform distribution on X. 


Example 15.8 (n = 2) A sample of i.i.d. (independent and identically distributed) points 
from the uniform distribution on the plane curve X = {x4 + y — 2x? — 2xy? — y+ 1 = 0} 
is shown in Figure 15.1. It is important that each connected component is covered. © 


We next discuss sampling from X by using MCMC methods. These methods set up 
a Markov process on X. Let us recall some basic definitions from the theory of Markov 
chains; see, e.g., [131, Chapter 3]. In this context, X is also called the state space. A Markov 
kernel is a map p : X x A — [0,1], such that 


1. p(x, - ) is a probability measure for all x € X; 
2. p(-,A) is a measurable function for all A € A. 


A stochastic process Xo, X1, X2, X3,... on X is a sequence of random points on X. 

A Markov process is a special type of stochastic process. We fix a point x € X. A 
(time-homogeneous) Markov process with starting point xg = x is the stochastic process 
where the probability that the next k points lie in certain measurable sets Aj,..., Ax is the 
following iterated integral over the Markov kernel: 


Prob(x, € Ax, ... ,X1 € Ay | Xo = X) 


=) -f p(x, dy1)p(y1, dy2) +: p(yk-1, Ak). 
yı€Aı y 


k-1€Ak-1 


A Markov process satisfies the Markov property: the probability law of the k-th state only 
depends on the position of the (k — 1)-th state, but not on the earlier states xo, . . . , Xx-2. 

In the discrete setting [167, Section 1.1], the Markov property is expressed as a con- 
ditional probability, where one conditions on x; = Z; and the z; are fixed points. In our 
continuous setting, this is not possible because it involves conditioning on an event of 
probability zero. Instead, one has a more technical description. Let 


uk (x, +): AH Prob(xz € A | xo =x) (15.4) 


denote the probability law of the k-th state. Then, the Markov property is 


Prob(x, € A | Xx-1, ..-, X1, Xo = X) = Prob(x; € A | x9 =z) = p(z, A), where z ~ a. 
This is found in the textbook by Meyn and Tweedie [131, Proposition 3.4.3]. The left-hand 
side is a probability conditioned on the o-algebra generated by the random variables x;. 
We will not discuss the technical details of the Markov property. Instead, we explain 
what it means. If the (k — 1)-th state x-1 is at position z, then the probability of passing 
from z to a k-th state xz € A is given by the Markov kernel p(z, A). As in the discrete 
setting, the probability law of the x, state only depends on the position of xz_;, but not 
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on the earlier states. Due to its role in the transition from one state to the next state, the 
Markov kernel p is also called transition probability. Let us now see some examples. 


Example 15.9 (Markov Chains in R?) For our state space, we take X = R?. We shall 
present two examples of Markov Chains in R? starting at the origin. For the first, we take 
pi(x,dy) = (27)! - exp(—5llyll7) dy as the kernel. The stochastic process arising from 
this passes from the state x € R? to the next state by sampling a normal vector y in RÊ? with 
covariance matrix the identity and mean value 0. The next state is then y. In particular, the 
transition probability is entirely independent of the present state x. 

Our second example has the kernel p>(x,dy) = (27)7! exp(—3||x — y||?) dy. This is 
more interesting than the previous process. In this Markov chain, passing from a state 
x € R? to the next state y works by sampling a normal vector u in R? with covariance 
matrix the identity and mean value 0. The vector u is the step, so the next state is y = x +u. 

We can simulate the first n = 10 steps in this Markov chain in Julia [20] as follows. 

x0 = [0; 0] 
states = []; 
push! (states, x0); 
n= 10 
for k in 1:n 
x = states[k]; 
y = x + randn(2); 


push! (states, y); 
end 


Figure 15.2 shows one realization of this chain. © 
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Fig. 15.2: A sample of the first 10 steps of the Markov chain with transition probability at state x given by 
sampling a normal vector in R? with covariance matrix the identity and mean value x. See Example 15.9. 


Our aim is to obtain Markov chains on an arbitrary real variety X c R”. Suppose that X 
has dimension d and degree at least two. We can create a Markov chain on X using the 
following simple geometric idea. Suppose that chain is in state x € X. We sample a random 
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linear space L of complimentary dimension n — d passing through x. Then, we sample a 
point uniformly at random from X N L. This is the next state in the Markov chain. 


Example 15.10 (Markov Chain on a Surface) Fix n = 3 and let X be the surface defined 
by z — xy = 0. Given a point x on this surface, we sample the line L = {Ax = b} by 
sampling A with Gaussian entries and then setting b = Ax. In addition, we only accept 
states in the box R = [—8, 8] x [-8, 8] x [-64, 64]. The first few steps in this chain starting 
at xq = (0, 0,0) are implemented in Julia [20] as follows. 


using HomotopyContinuation 

@var x yz 

f = System([z - x * y], variables = [x; y; z]); 
is_in_R(p) = abs(p[1])<8 && abs(p[2])<8 && abs(p[3])<64; 
n= 10; 

xO = [0.0; 0.0; 0.0]; 

states = [x0]; 

for k in 1:n 


p = last(states); 

A = randn(2,3); b = A*p; 

L = LinearSubspace(A, b); 

S = solve(f, target_subspace = L); 


points = real_solutions(S); 

filter! (is_in_R, points); 

push! (states, rand(points)); 
end 


Figure 15.3 shows a realization. We think of steps taken on the surface itself. The arrows 
are drawn curvy in order to highlight the nonlinear state space of this Markov chain. © 


Fig. 15.3: A few steps of the Markov chain from Example 15.10 on the surface with equation z — xy = 0. 


Let us now work towards sampling. We will recall results from the survey [153] by 
Roberts and Rosenthal. A probability distribution x on X is called stationary for a Markov 
kernel p if it satisfies the following linear equation: 


1 p(x, A) a(dx) = a(A) frl AEA. (15.5) 
xexX 
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The idea of MCMC methods for sampling from a probability distribution z is to set up 
a Markov process on X with stationary distribution 2. The Markov chain starts at x € X. 
We want the probability measure u* (x, -) of the k-th state, as defined in (15.4), to converge 
to x as k — ov, Convergence is measured by the total variation distance. The total variation 
distance between two measures u and y is 


dtv(H,¥) := pale lu(A) - v(A)]. 


Thus, we aim to define a Markov process starting at an initial point x € X such that the 
distribution of the k-th state converges to m as k — 00; i.e., limg—soo drv(7, u*(x, -)) =0. 
The key properties for achieving this are irreducibility and aperiodicity. 


Definition 15.11 A Markov chain with kernel p is called irreducible if, for all states x € X 
and all measurable sets A € A with vol(A) > 0, there exists k € N such that u* (x, A) > 0. 


The interpretation of irreducibility is that all sets with positive volume will eventually 
be reached by the Markov chain, and here the chain can start at any point x € X. 


Remark 15.12 The expression vol(A) in Definition 15.11 refers to the Lebesgue measure. 
One can replace the Lebesgue measure with any other measure v. In that case, one speaks 
of v-irreducible Markov chains. 


Definition 15.13 Consider a Markov chain with kernel p and suppose that it has a stationary 
distribution x. Let r > 2. We call the chain periodic with period r if there exist pairwise 
disjoint subsets Aj,..., A, € A that satisfy the following two conditions for all indices i: 


(a) 2(A;) > 0 and 
(b) p(x, Ai+t moar) = 1 for all x € Aj. 


Otherwise, the chain is called aperiodic. 


Aperiodicity means that the Markov process does not move periodically between the sets 
Aj,...,A;. The next lemma covers most of the Markov chains that arise from constructions 
in algebraic geometry. Any Markov chain defined by a kernel p(x, A) = f a O, y) dy with 
positive continuous probability density ¢ satisfies the hypothesis of the lemma. 


Lemma 15.14 Consider a Markov chain with kernel p and stationary distribution n that 
is absolutely continuous with respect to the Lebesgue measure. Suppose that the kernel is 
positive, i.e., we have p(x, A) > 0 forall x € X and A € A with vol(A) > 0. Then, the 
Markov chain on the variety X corresponding to p is irreducible and aperiodic. 


Proof For irreducibility, fix x € X and a measurable set A € A with vol(A) > 0. Then, 
u'(x,A) = Prob(x; € A | Xo = x) = p(x,A) > 0. 


For aperiodicity, assume that there exist pairwise disjoint A,,..., A, E A with 2(A;) > 0 
such that the Markov chain moves periodically between the A;. Since m is absolutely 
continuous with respect to the Lebesgue measure, we also have vol(A;) > 0 for all i. Fix i 
and let x € A;. Then, p(x, A;) > 0 and so p(x, Aj+1 moar) < 1, which is a contradiction. O 
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For an irreducible and aperiodic Markov chain with stationary distribution, we have the 
following convergence result [153, Theorem 4]. This is a continuous analogue of the Perron— 
Frobenius theorem from linear algebra, which concerns distributions on finite state spaces. 


Theorem 15.15 Consider an irreducible and aperiodic Markov chain with kernel p and 
stationary distribution n. For almost allx € X, it converges to n in total variation distance: 


lim dry(x, u*(x, -)) =0. 


Theorem 15.15 says that irreducible and aperiodic Markov chains converge to stationary 
distributions. See Sullivant’s book [167, Theorem 9.2.3] for the discrete case. The speed 
of convergence is called mixing time in the literature. For instance, a basic result on mixing 
times of Markov chains with continuous state space is [153, Theorem 8]. Roberts and 
Rosenthal [153] attribute this to Doeblin, Doob, and also to Markov. 

For Markov chains arising in the context of algebraic varieties, for instance, the one 
in Example 15.10 defined by linear sections, it is interesting to study and describe their 
stationary distributions. However, it is not even clear whether they exist. In the next section, 
we turn the perspective around. Our starting point will be a stationary distribution 7, and 
we will set up a Markov chain whose stationary distribution is 7. 


15.4 Chow goes to Monte Carlo 


The algebraic geometer Wei-Liang Chow obtained his doctoral degree from Leipzig Uni- 
versity in 1936. He is famous for fundamental contributions to the intersection theory of 
algebraic varieties. The Chow form encodes a variety of dimension d by the linear subspaces 
of codimension d+ | that intersect it. We learn about Chow varieties and Chow polytopes in 
the book by Gel’ fand, Kapranov, and Zelevinsky [71]. Metric algebraic geometry suggests 
that we take a look at objects named after Chow through the probabilistic lens. 

In this section, we study the problem of sampling from a variety X using the Markov 
Chain Monte Carlo (MCMC) paradigm. This involves intersecting X with random linear 
spaces, whence our pointer to Chow. This gives rise to a class of algorithms, which we 
name Chow—Markov Chain Monte Carlo (CMCMC). 

Our point of departure is Theorem 15.15. This result has the following algorithmic 
consequence. For sampling from a distribution 7, we set up an irreducible and aperiodic 
Markov chain with stationary distribution x. Then, if we let the Markov chain run long 
enough, the points in the process will have a probability distribution close to x. This is the 
idea underlying MCMC. The key task is thus to find and implement such a chain. We shall 
explain how to find a Markov chain whose stationary distribution is 7. One approach to 
verifying that z is the stationary distribution is to show that the chain is reversible. 


Definition 15.16 A Markov chain with kernel p is reversible with respect to ~ if 


1 p(x, A) (dx) = ni p(x, B)z(dx) forall A,B € A. 
xeB xeA 


204 15 Sampling 


Lemma 15.17 If a Markov chain with kernel p is reversible with respect to a probability 
distribution n, then x is a stationary distribution for that Markov chain. 


Proof We have to check the equation in (15.5). Fix A € A. Then 


[pena ata) = [pla xy man) = 214), 
xEex 


xeA 
since p(x, - ) is a probability measure, so that p(x, X) = 1 for all x € X. o 


Lemma 15.17 implies that the Metropolis—Hastings algorithm (see Algorithm 6 below) 
creates a Markov chain with stationary distribution z. We refer to [153, Proposition 2], 
and [167, Proposition 9.2.2] for the discrete case. 

The Metropolis—Hastings algorithm works for a target distribution 7 that has a density @. 
The basic idea is to take another Markov chain whose kernel p(x, A) has a density q(x, y); 
i.e., p(x, dy) = q(x, y) dy. The density q is called a proposal density. Sampling from the 
proposal density creates a random proposal point y, which is either accepted or rejected 
depending on how likely it is that the proposal point y was sampled from x. Notice that in 
the algorithm we only need to evaluate @(y) and q(x, y) up to scaling. 


Algorithm 6: The Metropolis—Hastings algorithm. 


Input: A probability measure z on X with density ¢(y). A Markov kernel p(x, A) on X with 
density q(x, y). A fixed starting point x € X. 
Output: A Markov chain on X with stationary distribution zr. 
1 Set xp =x. 
2 fork =0,1,2,...do 
3 Sample y ~ p(Xx, >). 
4 if (xx) = 0 or q (Xk, y) = 0 then 
5 | Set w(x, y) =0 
6 else 
7 Compute w(xx, y) = min {1 ae ; 
$x) -q (xk, y) 
8 end 
9 Sample a Bernoulli random variable 8 € {0, 1} with Prob{8 = 1} = w(xx, y). 
10 if 6 = 1 then 


1 | Set Xk+ := y. 
12 else 

B | Return to line 3. 
14 end 

15 end 


Example 15.18 (The Symmetric Metropolis Algorithm) For a symmetric proposal den- 
sity q(x, y) = q(y, x), the Metropolis—Hastings algorithm (Algorithm 6) is called Symmet- 
ric Metropolis Algorithm. For instance, the Markov kernel from Example 15.9 with density 
q(x, y) « exp(—4 |x — y||) is symmetric. ° 
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Example 15.19 (Random Walks) We speak of a Random—Walk Metropolis—Hastings Al- 
gorithm if the proposal density q(x, y) is a function of the difference x — y. An example for 
this is the density from Example 15.9. © 


Example 15.20 (Independence Sampler) We call Algorithm 6 an independence sampler 
if g(x, y) does not depend on x. In this case, samples from Algorithm 6 are independent. o 


The Metropolis—Hastings algorithm and its variants in the previous examples can be 
used for any sample space. We now turn to the setting of algebraic geometry, where our 
objects are algebraic varieties. 


Example 15.21 (Sampling points that are close to a variety) This follows the work of 
Hauenstein and Kahle in [81]. We will see how Algorithm 6 can be used for sampling 
points near a variety X c R”. To this end, choose a box R c R” and fix 0? > 0. Writing 
d(-, -) for the Euclidean distance, we consider the probability measure 2 on R given by 


(u) œ exp ( - sy d(u, x)’). 


Sampling from z produces points that are likely to be close to X. Let X be the Trott 
curve in (2.7) and let the box be R = [—2,2] x [-2,2]. We implement the Metropolis— 
Hasting algorithm in Julia [20] where the proposal density is the standard Gaussian in 
q(x, y) « exp(—4 \ly||?). This is an independence sampler (cf. Example 15.20). First, we 
set up the polynomial system for computing d(u, X). We solve it for a general complex 
point u that we will use later for running parameter homotopies. 


using HomotopyContinuation, LinearAlgebra 

@var x[1:2] u[1:2] 1 

f = 144(x[1]444+x[2]44) - 225(x[1]42+x[2]42) + 350x[1]42*x[2]42 + 81; 
df = differentiate(f, x); 

F = System([£; df - 1 .* (x-u)], variables = [x; 1], parameters = u); 
uC = randn(ComplexF64, 2); 

SC = solve(F, target_parameters = uC) 


Next, we implement the densities q and ¢. The variance is chosen to be oœ? = 1/100. We 
start out with a function that is a membership test for the box R. 


is_in_R(p) = abs(p[1])<2 && abs(p[2])<2; 
qu, v) = exp(-1/2 * norm(v) 2); 
sigma_sq = 1/100; 

function phi(u) 

S = solve(F, solutions(SC), 
start_parameters = uC, 
target_parameters = u, 
show_progress = false); 

R = map(s -> s[1:2], real_solutions(S)); 

d = minimum([norm(r - u) for r in R]); 

exp(-d42 / (2*sigma_sq)); 

end 


Finally, we run the Metropolis—Hastings algorithm for 10.000 steps: 
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n = 10000; 
ud = randn(2); 
states = [u0]; 
for k in 1:n 
uk = last(states); 
v = randn(2); 
if is_in_R(v) 
a = min(1, phi(v) * q(v, uk) / (phi(uk) * q(uk, v))); 
b = randQ); 
if a > b; push! (states, v); end 


end 
end 
The result is shown in Figure 15.4. © 
15 1.5 
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Fig. 15.4: Two samples of points near the Trott curve, generated with the Metropolis—Hastings algorithm, 
as explained in Example 15.21. The variance parameters are o°? = 1/100 (left) and a = 1/400 (right). 


One difficulty with Algorithm 6 when sampling from a variety X is to identify a proposal 
distribution with a density. Take, for instance, the Markov chain from Example 15.10, where 
the next step x41 € X is computed from the current step x, by taking a random linear 
space L through x; and sampling uniformly from the points in X N L. It is straightforward 
to describe the generation of this random variable. But it is not clear how to compute its 
density (or if such a density even exists). In such a scenario, we must prove that the chain is 
reversible with respect to r. A further complication arises when X is not connected. In that 
case, the proposals must be chosen in a way that no connected component of X is missed. 

We shall present two approaches to Metropolis—Hastings that do not require the knowl- 
edge of a proposal density. The first is the algorithm of Breiding and Marigliano [28], which 
uses an independence sampler. Second, we consider the algorithm of Leliévre, Stoltz, and 
Zhang [123], which is based on a random walk. We attach the label CMCMC to both 
algorithms, since they are based on linear slicings of the variety X. 

The CMCMC algorithm in [28] rests on a Markov chain in the affine Grassmannian 


Gra(c, R”) = {L c R” | Lis an affine linear space of dimension c}, 
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where c = codim X. Our next theorem describes the density with which we sample a linear 
space L € Grg(c, R”) (for instance, using Algorithm 6). As before, we assume that the 
target distribution has a density (y). For a pair (A, b) € R?” x RY, we define 


z (x) TE o: 1 + (x, Px x) r(4) 


oA, b) := i n 
l ) (1 + |[x|]2) (4+ D/2 yr”! 


xEX:Ax=b 


Here, d = dim X and Px is the orthogonal projection onto the normal space NxX. The 
additional factor œ is related to the change of variables when embedding R” into the 
n-dimensional real projective space. 


Theorem 15.22 Let y(A,b) be the probability density for which the entries of the pair 
(A, b) € R?” xR? are i.i.d. standard Gaussian. The following formula defines a probability 
density on the affine Grassmannian Gra(c, R"): 


(A, b): $(A, b) 
Cie ee 
4 E(4,b)~y O(A, b) 


The random linear space L := {x € R” | Ax = b} € Grg(c, R”) for (A, b) ~ y satisfies: 


(a) X A L is finite with probability one. _ 
(b) If we choose x in the finite set X A L with probability 6(x)/(a(x) - (A, b)), then the 
random point x is distributed according to the target density ¢ on the variety X. 


Proof See (28, Theorem 1.1]. m 


The algorithm in [123] is similar to the Metropolis—Hastings algorithm. The basic idea 
is as follows. Suppose we want to sample from a target density ¢(y) = exp(-V(y)). 
Here, V(y) is a smooth function, called the potential function. Given a point x € X, we 
create a proposal distribution by sampling first in a random tangent direction v € 7,X and 
then computing the intersection of X with the random linear space 


L = x+v+NxX € Grg(c,R”). 


This creates a Markov chain on X. However, it is not clear how to compute the density for 
this random proposal. The authors of [123] prove directly the reversibility of their Markov 
chain, so they can apply Lemma 15.17. Algorithm 7 shows a version of their approach. 


Remark 15.23 In practice, the random tangent vector v in line 3 of Algorithm 7 can be 
sampled as follows. Let U € R”*@ be a matrix whose columns form an orthonormal basis 
of TxX. Such a matrix can be computed with the Gram-Schmidt algorithm. If we sample 
u € Rf with iid. N(0, o7)-entries, then we can take v = Uu. 


The following theorem asserts that Algorithm 7 works correctly. For the proof we refer 
to [123, Theorem 1]. The second statement follows from Lemma 15.17. 


Theorem 15.24 Algorithm 7 yields a Markov chain on X that is reversible with respect to 
the desired distribution x. In particular, the Markov chain has x as stationary distribution. 
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Algorithm 7: The CMCMC algorithm due to Leliévre, Stoltz, and Zhang. 


Input: A probability measure z on X with a density exp(—V(y)), where V is a smooth function 
on X. A variance parameter a > 0. A fixed starting point x € X. 
Output: A Markov chain on X with stationary distribution 7. 
1 Set xp =x. 
2 fork =0,1,2,...do 


3 Draw a random tangent vector v € Tx, X by sampling v from the multivariate normal 
distribution on Tx, X with mean 0 and covariance matrix o°- I. 

4 Set L =x, +Y + Nx, X. 

5 Sample a point y € X N L uniformly. 

6 Compute w € 7,X such that x, € K := y +w + NyX. 

7 Compute 


w(xx, y) = min f1, OK -exp (= (V) - V@)) -exp (= zh (lw? = Ilvil?))} 


8 Sample a Bernoulli random variable B € {0, 1} with Prob{8 = 1} = w(xx, y). 
9 if 6 = 1 then 


10 | Set xk := y. 
ll else 

12 | Return to line 3. 
13 end 

14 end 


We have now reached the end of this book. Our presentation featured a wide range 
of tools and results from computational algebraic geometry that involve metric aspects of 
varieties. In addressing problems from applications, we are naturally led to cross paths with 
differential geometry, and — in this final chapter — also with probability theory. We learned 
that, in order to reliably sample from a variety X, it is essential to have a solid algebraic 
understanding of tools like ED degree, curvature, reach, medial axes, bottlenecks, etc. 
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